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Phytase variants 

FIELD OF THE INVENTION 

This invention relates to variants of phytases, in 
particular variants of asccycete phytases and variants of 
5 basidiomycete phytases, the corresponding cloned DNA sequences 
a method of producing such phytase variants, and the use thereof 
for a number of industrial applications. 

BACKGROUND OF THE INVENTION 

10 Phytic acid or myo-inositcl 1, 2, 3, 4, 5, 6-hexakis dihydrogen 

Phosphate (or for short myo-inositol hexakispnosphate, is the 
prxmary source of inositol and the primary storage form of 
Phosphate in plant seeds. Phytin is a mixed potassium, magnesium 
and calcium salt of inositol. 
15 The phosphate moieties of phytic acid chelates divalent 

and trivalent cations such as metal ions, i.a. the nutritionally 
essential ions of calcium, iron, zinc and magnesium as well as 
the trace minerals manganese, copper and molybdenum. 

Phytic acid and its salts, phytates, are often not 
20 metabolized, i.e. neither the phosphorous thereof, nor the 
chelated metal ions are nutritionally available. 

Accordingly, food and feed preparations need to be 
supplemented with inorganic phosphate and often also the 
nutritionally essential ions such as iron and calcium, must be 
25 supplemented. 

Still further, the phytate phosphorus passes through the 
gastrointestinal tract of such animals and is excreted with the 
manure, resulting in an undesirable phosphate pollution of the 
environment resulting e.g. in eutrophication of the water 
30 environment and extensive growth of algae. 
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Phytic acid or phytates, said terms being, unless 
otherwise indicated, in the present context used synonymously or 
at random, are degradable by phytases. 

The production of phytases by plants as well as by 
5 microorganisms has been reported. Amongst the microorganisms, 
phytase producing bacteria as well as phytase • producing fungi 
are known. 

There are several descriptions of phytase producing 
filamentous fungi belonging to the fungal phylum of Ascomycota 
10 (ascomycetes) . In particular, there are several references to 
phytase producing ascomycetes of the Aspergillus genus such as 
Aspergillus terreus (Yamada et al., 1986, Agric. Biol. Chem. 
322:1275-1282). Also, the cloning and expression of the phytase 
gene from Aspergillus niger var. awamori has been described 
15 (Piddington et al., 1993, Gene 133:55-62). EP 0420358 describes 
the cloning and expression of a phytase of Aspergillus ficuum 
(niger) . EP 0684313 describes the cloning and expression of 
phytases of the ascomycetes Aspergillus niger, Myceliophthora 
thermophila, Aspergillus terreus. Still further, some partial 
20 sequences of phytases of Aspergillus nidulans, Talaromyces 
thermophilus, Aspergillus fumigatus and another strain of 
Aspergillus terreus are given. 

The cloning and expression of a phytase of Thermomyces 
lanuginosus is described in WO 97/35017. 
25 There is a current need for phytases of amended properties 

or characteristics, e.g. phytases of increased thermostability, 
altered pH optimum (a high pH optimum being desirable for in- 
vitro processing, a low for in-vivo processing in the gastro- 
intestinal tract) , and/or of a higher specific activity. 
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SUMMARY OP THE INVENTION 

In a first aspect, the invention provides phytase 
variants, the characteristics of which are amended - as compared 
to a so-called model phytase. 
5 Any model phytase, which is of a certain similarity to 

thirteen herein specifically disclosed model phytases, can be 
made the model of such variants. 

In another aspect, the invention relates to a novel 
phytase derived from Cladorrhinum f oecundissimum. 
10 In still another aspect, the invention provides DNA 

sequences encoding these phytase variants and this phytase, and 
methods of their production. 

Finally, the invention also relates generally to the use 
of the phytase and the phytase variants for liberating 
15 phosphorous from any phytase substrate, in particular inorganic 
phosphate from phytate or phytic acid. 

BRIEF DESCRIPTION OP THE DRAWINGS 

In the detailed description of the invention below, 
20 reference is made to the drawings, of which 

Fig. 1 is an alignment of thirteen specific phytase 

sequences (a multiple sequence alignment according 
to the program PileUp; GapWeight : 3.000; 
GapLengthWeight: 0.100); 

Fig. 2 this figure shows the amino acid and DNA sequence of 

a first phytase PP_involtus-Al") derived from 
strain CBS 100231 of Paxillus involutus which was 
deposited on 28.11.97; the expression plasmid pYES 
2.0 comprising the full length cDNA sequence was 
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transformed into E. coli strain DSM 11842 which was 
deposited on 12.11.97 (see WO 98/28409); 

this figure shows the amino acid and DNA sequence of 
a second phytase ("P_involtus-A2") derived from 
strain CBS 100231 of Paxillus involutus which was 
deposited on 28.11.97; the expression plasmid pYES 
2.0 comprising the full length cDNA sequence was 
transformed into E. coli strain DSM 1184 3 which was 
deposited on 12.11.97 (see WO 98/28409); 

this figure shows the amino acid and DNA sequence of 
a phytase ( w T_pubescens") derived from strain 
CBS 100232 of Trametes pubescens, which was 
deposited on 28.11.97; the expression plasmid pYES 
2.0 comprising the full length cDNA sequence was 
transformed into E. coli strain DSM 11844 which was 
deposited on 12.11.97 (see WO 98/28409); 

this figure shows the amino acid and DNA sequence of 
a phytase ("A_pediades") derived from strain CBS 
900.96 of Agrocybe pediades deposited on 04.12.96; 
the expression plasmid pYES 2.0 comprising the full 
length cDNA sequence was transformed into E. coli 
strain DSM 11313 which was deposited on 02.12.96 
(see WO 98/28409) ; 

this figure shows the amino acid and DNA sequence of 
a phytase ("P_lycii") derived from strain CBS 686.96 
of Peniophora lycii which was deposited on 04.12.96; 
the expression plasmid pYES 2.0 comprising the full 
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length cDNA sequence was transformed into E. coli 
strain DSM 11312 which was deposited on 02.12.96 
(see WO 98/28409) ; 

5 Fig. 7 this figure equals figure 2 of EP 0684313 and shows 

the amino acid and DNA sequence of a phytase 
PM_thermophila") derived from strain ATCC 48102 
(=ATCC 74340) of Myceliophthora thermophila which 
was re-deposited on 14.03.97; 



10 



Fig. 8 this figure shows the amino acid and DNA sequence of 

a phytase PA_f umigatus") derived from strain ATCC 
13073 of Aspergillus fumigatus (see EP 0897985); 

15 Fig. 9 this figure shows the amino acid PConphys") and DNA 

sequence of an ascomycete consensus phytase (in the 
present context called "consphyA") (see EP 0897985); 
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Fig. 10 this figure shows the amino acid and DNA sequence of 

a phytase («A_nidulans") derived from strain 
DSM 9743 of Aspergillus nidulans (see EP 0897985); 

Fig. 11 this figure equals figure 8 of EP 0420358 and shows 

the amino acid and DNA sequence of a phytase 
PA_ficuum") derived from Aspergillus ficuum strain 
NRRL-3135; 

Fig. 12 this figure equals figure 1 of EP 0684313 and shows 

the amino acid and DNA sequence of a phytase 
("A_terreus") derived from strain CBS 220.95 of 
Aspergillus terreus; 
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Fig, 13 this figure shows the amino acid and DNA sequence of 

a phytase ("T_thermo") derived from strain ATCC 
. 20186 (=ATCC 74338) of Talaromyces thermophilus 
5 which was redeposited on 14.03,97 (see EP 0897985); 

Fig." 14 this figure equals figure 2 of WO 97735017 and shows 

the amino acid and DNA sequence of a phytase 
("T_lanuginosa") derived from strain CBS 586.94 of 
10 Thermomyces lanuginosus; a plasmid comprising the 

full length cDNA sequence was transformed into 
E. coli DHSct (pMWR46) strain B-21527 which was 
deposited with NRRL on 23.02.96; 

15 Fig- 15 this figure shows the amino acid and DNA sequence of 

a phytase ("C_f oecundissimum") derived from strain 
CBS 427.97 of Cladorrhinum f oecundissimum which was 
deposited on 23 January 1997; the expression plasmid 
pYES 2.0 comprising the full length cDNA sequence 

20 was transformed into E. coli strain DSM 127 42 which 

was deposited on 17 March 1999; 

Fig. 16 this figure shows . an alignment of the phytase 

C_foecundissimum with the model phytase 
25 M_thermophila, using the program GAP gcg (Gap Weight 

3.000; Length Weight 0.100); and 

Fig. 17 shows how the C_f oecundissimum phytase can be pasted 

onto the alignment of Fig. 1. 
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DETAILED DISCLOSURE OF THE INVENTION 

Phytase 

in the Present context , phytase ia ^ ^ 
the hydrolysis . „ f phytate 
» hexaxisphosphate, to ,1, myo-inositol and/or < 2 > mono-, di _ 
tn-, tetra- and/or penta-phosphates thereof and (3) inorganic 
Phosphate, In the following, for short, the above compounds are 
sometimes referred to as IP6, I, I P1 , In , IP3 , IP4 , „ s and p 
respectively. This means that by action of a phytase, IPS is 
10 degraded into P + one or more of the components IPS, 1P4 IP3 
IP2, IP1 and I. Alternatively, myo-inositol carrying in total n 
Phosphate groups attached to positions p, q , r, . . is denoted 
In S (p,q,r,..)Pn. For convenience Ins (1, 2, 3, 4, 5, 6) P6 (phytic 
acid) is abbreviated PA. 

According to the Enzyme nomenclature database ExPASy (a 
repository of information relative to the nomenclature of 
enzymes primarily based on the recommendations of the 
Nomenclature Committee of the International Union of 
Biochemistry and Molecular Biology (I0BMB) describing each type 
20 of characterized enzyme for which an EC (Enzyme Commission) 
number has been provided) , two different types of phytases are 
known: A so-called 3-phytase (myo-inositol hexaphosphate 3- 

phosphohydrolase, EC 3 1 1 p\ •, , , 

-3 • J. • 3 . 8 ) and a so-called 6-phytase (myo- 
inositol hexaphosphate 6-phosphohydrolase, EC 3.1.3.26) The 3- 
25 phytase hydropses first the ester bond at the D-3-position 
whereas the 6-phytase hydroly 2es first the ester bond at fche ^ 
6 " or L-6-position. 

The expression "phytase" or "polypeptide or enzyme 
exhibiting phytase activity" is intended to cover any enzyme 
30 capable of effecting the liberation of inorganic phosphate or 
Phosphorous from various myo-inositol phosphates. Examples of 
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such myo-inositol phosphates (phytase substrates) are phytic 
acid and any salt thereof, e.g. sodium phytate or potassium 
phytate or mixed salts. Also any stereoisomer of the mono-, di-, 
tri-, tetra- or . penta-phosphates of myo-inositol might serve as 
5 a phytase substrate. A preferred phytase substrate is phytic 
acid and salts thereof. 

In accordance with the above definition, the phytase 
activity can be determined using any assay in which one of these 
substrates is used. In the present context (unless otherwise 
10 specified) the phytase activity is determined in the unit of 
FYT, one FYT being the amount of enzyme that liberates 1 umol 
inorganic ortho-phosphate per min. under the following 
conditions: pH 5.5; temperature 37 °C; substrate: sodium phytate 
(C 6 H 6 0 24 P s Na 12 ) in a concentration of 0.0050 mol/1. A suitable 
15 phytase assay is described in the experimental part. 

The present invention provides a genetically engineered 
phytase as described in the appending claims. 

A genetically engineered phytase is a non-naturally 
occuring phytase which is different from a model phytase, e.g. a 
20 wild-type phytase. Genetically engineered phytases include, but 
are not limited to, phytases prepared by site-directed 
mutagenesis, gene shuffling, random mutagenesis etc. 

The invention also provides DNA constructs, vectors, host 
cells, and methods of producing these genetically engineered 
25 phytases and phytase variants, as well as uses thereof. 

A phytase variant is a polypeptide or enzyme or a fragment 
thereof which exhibits phytase activity and which is amended as 
compared to a model phytase. 

Amended means altered by way of one or more amino acid or 
30 peptide substitutions, deletions, insertions and/or additions 
in each case by, or of, one or more amino acids. Such 
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substitutions, deletions, insertions, additions can be achieved 
by any method known in the art, e.g. gene shuffling, random 
mutagenesis, site-directed mutagenesis etc. 

The model or parent phytase, f rom which the phytase 
5 variant is derived, can be any phytase, e.g. a wild-type phytase 
or a derivative, mutant or variant thereof, including allelic 
and species variants, as well as genetically engineered variants 
thereof, which e.g. can be prepared by site-directed 
mutagenesis, random mutagenesis, shuffling etc. 

Included in the concept of model phytase is also any 
hybrid or chimeric phytase, i.e. a phytase which comprises a 
combination of partial amino acid sequences derived from at 
least two phytases. 

The hybrid phytase may comprise a combination of partial 
15 amino acid sequences deriving from at least two ascomycete 
phytases, at least two basidiomycete phytases or from at least 
one ascomycete and at least one basidiomycete phytase. These 
ascomycete and basidiomycete phytases from which a partial amino 
acid sequence derives may, e.g., be any of those specific 
20 phytases referred to herein. 

In the present context, a hybrid, shuffled, random 
mutagenised, site-directed mutagenised or otherwise genetically 
engineered phytase derived from ascomycete phytases only is also 
an ascomycete phytase; and a hybrid, shuffled, random 
25 mutagenised, site-directed mutagenised or otherwise genetically 
engineered phytase derived from model basidiomycete phytases 
only is also a basidiomycete phytase. Any hybrid derived from at 
least one ascomycete phytase as well as at least one 
basidiomycete phytase is called a mixed ascomycete/basidiomycete 
30 phytase and such phytase is also a model phytase in the present 
context. 
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Analogously, a hybrid, shuffled, random mutagenised, site- 
directed mutagenised or otherwise genetically engineered phytase 
derived from one or more Aspergillus phytases is also an 
Aspergillus derived phytase; and a hybrid, shuffled, random 
5 mutagenised, site-directed mutagenised or otherwise genetically 
engineered phytase derived from any other taxonomic sub-grouping 
mentioned herein is also to be designated a phytase derived from 
this taxonomic sub-grouping. 

Still further, in the present context, "derived from" is 
10 intended to indicate a phytase produced or producible by a 
strain of the organism in question, but also a phytase encoded 
by a DNA sequence isolated from such strain and produced in a 
host organism transformed with said DNA sequence. Finally, the 
term is intended to indicate a phytase which is encoded by a DNA 
15 sequence of synthetic and/or cDNA origin and which has the 
identifying characteristics of the phytase in question. 

Preferably the model phytase is a phytase which can be 
aligned as described below to either of the thirteen phytases of 
Fig. 1 (which are particularly preferred model phytases) . 
20 Preferred wild-type model phytases (i.e. neither 

recombinant, or shuffled or otherwise genetically engineered 
phytases) have a degree of similarity or homology, preferably 
identity, to amino acid sequence no. 38-403 (Peniophora numbers) 
of either of these thirteen phytases of at least 40%, more 
25 preferably at least 50%, still more preferably at least 60%, in 
particular at least 70%, especially at least 80%, and in a most 
preferred embodiment a degree of similarity of at least 90%. 

Preferred recombinant or shuffled or otherwise genetically 
engineered model phytases have a degree of similarity or 
30 homology, preferably identity, to partial sequence no. 38-4 9, 
63-77, 274-291, 281-300 and 389-403 (Peniophora numbers) of 
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either of these thirteen phytases of at least 60%, more 
preferably at least 70*. still more preferably at least 80%, in 
particular at least 90%. 

In a preferred embodiment the degree of similarity is 
5 based on a comparison with the complete amino acid sequence of 
either of the thirteen phytases. 

The degree of similarity or homology, alternatively 
identity,, can be determined using any alignment programme known 
m the art. A preferred alignment programme is GAP provided in 
10 the GCG version 8 program package (Program Manual for the 
Wisconsin Package, Version 8, August 1994, Genetics Computer 
Group, 575 Science Drive, Madison, Wisconsin, USA 53711) (see 
also Needleman, S.B. and Wunsch, CD., (1970), Journal of 
Molecular Biology, 48, 443-453). Using GAP with the following 
15 settings for polypeptide sequence comparison: GAP weight of 
3.000 and GAP lengthweight of 0.100. 

Also preferred is a wild-type model phytase which 
comprises an amino acid sequence encoded by a DNA sequence which 
hybridizes to a DNA sequence encoding amino acid sequence 38-403 
20 (Peniophora numbers) of any of the DNA sequences encoding the 
thirteen specific phytase sequences of Fig. 1. 

A further preferred model phytase is a genetically 
engineered phytase, which comprises an amino acid sequence 
encoded by a DNA sequence which hybridizes to a DNA sequence 
25 encoding amino acid sequence 38-49, and to a DNA sequence 
encoding amino acid sequence 63-77, and to a DNA sequence 
encoding amino acid sequence 274-291, -and to a DNA sequence 
encoding amino acid sequence 281-300, and to a DNA sequence 
encoding amino acid sequence 38 9-4 03 (Peniophora numbers) of any 
30 of the DNA sequences encoding the thirteen specific phytase 
sequences of Fig. 1. 
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In a preferred embodiment the hybridization is to the 
complete phytase encoding part of any of the thirteen phytases. 

Suitable experimental conditions for determining whether a 
given DNA or RNA sequence "hybridizes" to a specified nucleotide 
5 or oligonucleotide probe involves presoaking of the filter 
containing the DNA fragments or RNA to examine for hybridization 
in 5 x SSC (Sodium chloride/Sodium citrate), (J. Sambrook, E.F. 
Fritsch, and T. Maniatis, 198 9, Molecular Cloning, A Laboratory 
Manual, 2d edition, Cold Spring Harbor, New York) for 10 min, 
10 and prehybridization of the filter in a solution of 5 x SSC, 5 x 
Denhardt's solution (Sambrook et al. 1989), 0.5 % SDS and 100 
ug/ml of denatured sonicated salmon sperm DNA (Sambrook et al. 
1989), followed by hybridization in the same solution containing 
a concentration of 10 ng/ml of a random-primed (Feinberg, A. P. 
15 and Vogelstein, B. (1983) Anal. Biochem. 132:6-13), 32P-dCTP- 
labeled (specific activity > 1 x 10 9 cpm/ug) probe for 12 hours 
at approximately 45 °C. 

The filter is then washed twice for 30 minutes in 2 x SSC, 
0.5 % SDS at at least 55°C (low stringency), at at least 60°C 
20 (medium stringency), at at least 65°C (medium/high stringency), 
at at least 70°C (high stringency), or at at least 75°C (very 

high stringency) . 

Molecules to which the oligonucleotide probe hybridizes 
under these conditions are detected using an x-ray film. 

25 It should be noted that a certain specific phytase variant 

need not actually have been prepared from a specific model 
phytase, for this model phytase to qualify as a "model phytase" 
in the present context. It is sufficient that the variant 
exhibits at least one of the herein indicated amendments when it 

30 is afterwards compared with the model phytase. 
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The alignment of Fia l < q m= w 
(Program Manua! for the wise Pr09ram Pllet * 

"94, cenetics Col t ^ 
ecics Computer Group, 57s q„j 

Wisconsin, 0S A 53711, * ^ Madison - 

new phytase variant , u hir ^ 3 ^ or a 

— r with the. new p^tT^L^^T ^ " ^ 

or. aiternatively, at least ^ ali ^ nt - 

model phytase i a , • follows: The new 

» «- -,- t of ;i l . t: c^rr odei phytase has 

-dogy, and for making the I "LI! ^ ° f 
the two seances the aCC ° rdln9 t0 1 " ° f 

P~fera b ly u Ie , ^ f i,~ ~ ^ * - 

,or Phyt ase valnt^ Z^"^ £ T ^ 
20 at Fig. i us ,- n „ 1P ed) to the alignment 

9- J- using the result of the fir,. = t 
identical and h n alignment (placing 

prescri::;:;:;r~ o , acid residues awe - — 

— J Jlas^^iar " in9 " hlCh C — 
Example 7 shows an example of how to add 
» Phytase to the ai lgnment of J t0 * «-l 

Phytase variants thereof. * 

Other model phytases can be a ii m » rf „ 
in . aligned and variants deduced 

in analogy with Example 7. This 1* .„ ■ educed 

following mode! phases- The w " ^ the 

30 awamori , 08 pate t * °' *^' r ' 1U ™ «r. 

WO 98/ 0 68 " "I" 5 ' 83 °' 733 " ^ P^tase of 

6858, tha soy bean phytase Qf wo ^ 
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phytase of WO 98/05785; the Aspergillus phytase of WO 97/38096; 
the phytases of Monascus an*a of WO 98/13480; the phytase from 
Schwanniomyces occidentals of EP 0699762 etc. 

when comparing a model phytase and a proposed phytase 
s variant using the alignment as described herein, corresponding 
amino acid positions can be identified, viz. a model 
the model phytase and a variant position of the varrant - the 
corresponding model position and variant position are srmply 
pl aced one above the other in the alignment. An amendment rs 
10 said to have occurred in a given position if the model amrno 
acid of the model position and the variant amino acxd of 
variant position are different. Preferred amendments of these 
positions manifest themselves as amino acid substitutes, 

deletions or additions. 

amended in at least one position means amended rn one 
more positions, i.e. in one, two, three, four, five, six. seven, 
eight! nine, ten, eleven, twelve etc. up to all N pos.tron 
Tilted. This definition includes any possible sub-combrna rons 
thereof, e.g. any set o, two substitutions, any set of three, 
„ any set of four, etc. - to any set of W-l) positions. 

in the present context all seguences, whatever the model 
phytase, and including the thirteen seguences of Frg. 1. are 
numbered using the numbering corresponding to the phytase 
P lycii. These -Peniophora numbers" are indicated at Frg , 
25 together with the "alignment numbers." The numbering of P.lycu 

starts at Ml and ends at E439. .„•„„„ 
As explained above, the alignment reveals which positions 
in various phytase seguences other than P_lycii are eguivalent 
or corresponding to the given P. lycii positron. 

A substitution of amino acids is indicated herern as for 
instance »3S,» which indicates, that at position 3 amino acrd 
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should be substituted for the "original" or model pos±tion g 
amino acid, whichever it is. Thus , the substitution shQuld 
result in an S in the corresponding variant position 
Considering now the alignment at Fig. lf a substitution 1±ke 
5 e.g. «3S" is to be interpreted as follows, for the respective 
Phytases shown (the amino acid first indicated is the "original" 
or model amino acid in "Peniophora position" 3): 



10 



15 



20 



P_involtus_Al: 
P_involtus_A2 : 
T_pubescens : 
A_pediades : 
P_lycii : 

A_fumigatus : 

consphyA: 

A_nidulans : 

A_f icuum_NRRL3 1 3 5 

A_terreus : 

T_thermo : 

T_lanuginosa: 

M_thermophila : 



F3S (number 3 F substituted by S) 

L3S 

MIS 

MIS 

redundant (already an S) 

T5S 

V5S 

T5S 

A5S 

A5S 

L5S 

V11S 

G5S 



However, in what follows the above specific substitutions 
will be designated as follows (always using the Peniophora 



numbering) 



25 



30 



P_involtus_Al: 
P_involtus_A2: 
Tjpubescens: 
Ajpediades: 
P_lycii : 

A_fumigatus: 
consphyA: 



F3S 
L3S 
M3S 
M3S 

redundant (already an S) 

T3S 

V3S 
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A_ 


nidulans: 


T3S 


A_ 


£ i cuumJJRRL 3135: 


A3S 


A_ 


terreus: 


A3S 


T_ 


_thermo: 


L3S 


T_ 


lanuginosa: 


V3S 


M 


thermophila: 


. G3S 



Still further, denotations like e.g. *3S,F,G" means that 
the amino acid in position 3 (Peniophora numbers) of the model 
> phytase in question is substituted with either of S, "F or G, 
i.e. e.g. the designation "3S,F,G" is considered fully 
equivalent to the designation "3S, 3F, 3G" . 

A denotation like ()3S means that amino acid S is added to 
the- sequence in question (at a gap in the actual sequence), in a 
5 position corresponding to Peniophora number 3 - and vice versa 

for deletions (S3()). 

In case of regions in which the Peniophora phytase 
sequence has larger deletions than some of the other phytases in 
Fig. 1, for instance in the region between position 201 and 202 
0 (Peniophora numbers), intermediate positions (amino acid 
residues in other sequences) are numbered by adding ■ a, b, c, d, 
etc, in lower-case letters, to the last Peniophora position 
number, e.g. for the phytase M_thermophila: E201; G201a; P201b; 
Y201c;' S201d; T201e; I201f ; G202; D203 etc. 
-.5 in one of the priority applications of the present 

application there are two minor position numbering errors: 
According to the above definitions, the positions referred to in 
- first priority application as 204 and 205 (Peniophora 
, rg) are wrongly designated; they should have been numbered 
nd 204, respectively. Therefore, 204 has been substituted 
and 205 by 204 throughout the present application. 
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A preferred phytase variant of the invention comprises an 
amino acid sequence which comprises, preferably contains, one or 
more of the following amino acid substitutions: 

24C; 27P; 31Y; 33C; 39H,S,Q; 40L,N; 42S,G; 

5 43A,C,D,E,F,G,H,I,K,L,M,N,P,Q,R, S ,T,V,W,Y; 44N; 45D,S; 47Y,F; 
49P; 51E,A,R; 56P; 58D,K,A; 59G; 61R; 62V, I; 69Q; 75W, F; 78D;S; 
79G; 80K,A; 81A,G,Q,E; 82T; 83A,I,K,R,Q; 84I,Y,Q,V; 881; 90R,A; 
102Y; 115N; 116S; 118V,L; 119E; 120L; 122A; 123N,Q,T; 125M,S; 
126H,S,V; 127Q,E,N; 128A,S,T; 132F,I,L; 143N; 148V,I; 151A,S; 
10 152G; 153D,Y; 154D,Q,S,G; 157V; 158D,A; 159T; 160A,S; 161T,N; 
162N; 163W; 170fH; 170gA; 171N; 172P; 173Q,S; 184Q,S,P; 185S; 
186A,E,P; 187A; 187aS; 190A, P; 193S; 194S,T; 195T,V,L; 198A,N,V; 
200G,V; 201D,E; a deletion, of at least one of 201a, 201b, 201c, 
201d, 201e, 201f, preferably all; 201eT; 202S,A; 203R,K,S; 
15 203aV,T; 204Q, E, S, A, V; 205E; 211L,V; 215A,P; 220L,N; 223H,D; 
228N; 232T; 233E; 235Y,L,T; 236Y,N; 237F; 238L,M; 242P,S; 244D; 
246V; 251eE,Q; 253P; 256D; 260A,H; 264R,I; 265A,Q; 267D; 
270Y,A,L,G; 271D,N; 273D,K; 275F,Y; 278T,H; 280A,P; 283P; 
287A,T; 288L,I,F; 292F,Y; 293A,V; 302R,H; 304P,A; 332F; 33.6S; 
20 337T,G,Q,S; 3381; 339V,I; 340P,A; 343A, S, F, I, L; 348Y; 349P; 
352K; 360R; 362P; 364W,F; 365V,L,A,S; 366D,S,V; 367A,K; 368K; 
3691, L; 370V; 373A,S; 374S,A; 375H; 376M; 383kQ,E; 387P; 393V; 
396R; 404A,G; 409R; 411K,T; 412R; 417E,R; 421F,Y; 431E. 

In a preferred embodiment this is with the proviso that 
25 the model phytase does not already comprise the above suggested 
amino acid substitution or addition or deletion at the position 
indicated. Or, with the proviso that, f or . each position, the 
model amino acid is not already the variant amino acid hereby 
proposed. But these provisos can be said to be in fact already 
30 inherent in the above wording, because of the expression 
"amended. " 
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The various preferred phytase variants of claims 16-34 
comprises, preferably contains or have, amino acid sequences 
which comprise or contain one or more of the amino acid 
substitutions, additions, or deletions listed in the respective 
5 claims . 

In a preferred embodiment the various phytase variants 
comprise 1, 2, 3, 4, 5, 6, 7, 8, 9 or even 10 of these 
substitutions; or a number of substitutions of 10-15, 15-20, 20- 
30 or even 30-50; eventually up to 60, 70, 80 or 90 
10 substitutions . 

In another preferred embodiment, the amino acid sequence 
of the various phytase variants comprise one . or more 
substitutions of the ; substitution sub-groupings listed 
hereinbelow; or combinations of substitutions classified in two 
15 or more sub-groupings. 

Generally, instead of "comprise," "contain" or "have," the 
amino acid sequences of preferred variants "consist essentially 
of" or "consist of" the specific model phytases of fig. 1, as 
modified by one or more of the substitutions described herein. 
20 In the present context a basidiomycete means a 

microorganism of the phylum Basidiomycota . This phylum of 
Basidiomycota is comprised in the fungal kingdom together with 
e.g. the phylum Ascomycota ("ascomycetes") . 

Taxonomical questions can be clarified by consulting the 
25 references listed below or by consulting a fungal taxonomy 
database (NIH Data Base (Entrez) ) which is available via the 
Internet on World Wide Web at the following address: 
http : //www3 . ncbi . nlm. nih . gov/Taxonomy/tax . html . 

For a definition of basidiomycetes, reference is made to 
30 either Julich, 1981, Higher Taxa of Basidiomycetes; Ainsworth & 
Bisby's (eds.) Dictionary of the Fungi, 1995, Hawksworth, D.L., 
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P.M. Kirk, B.C. Sutton & D.N. Pealer- ^ u 

Pegler, or Hansen & Knudsen 
(Eds.), Nordic Macromycetes, vol. 2 (1992) and 3 (1997) a 
preferred reference is Hansen & Knudsen. 

For a definition of ascorcycetes, reference is znade to 
5 e.ther of Ainsworth & Brisby c±ted ^ ^ ^ 

by Eriksson, O.E. & D. L . Hawksworth, Vol. 16, 1998 . A preferred 
reference is Eriksson et al. 

Generaliy, a microorganism which is classified as a 
basidiomycete/ascomycete in either of the references listed 
1. above, including the database, is a basidiomycete/ascomycete in 

the present context. 

Some Aspergillus strains are difficult to classify because 
they are anamorphous, and .therefore they might be classified in 
Fungi imperfect!. However, once the teleomorphous counterpart is 
" round, it is re-classified taxonomically. For instance, the 
teleomorph of A. nidulans is Emericella nidulans ,of the family 
Trichocomaceae, the order Eurotiales, the class Plectomycetes of 
the phylum Ascomycota, . These subgroupings of Ascomycota are 
preferred, together with the family Lasiosphaeriaceae, the order 
20 Sordariales, the class Pyrencmycetes of the phylum Ascomycota. 

The wording "ascomycetes" and analogues as used herein 
includes any strains c, Aspergillus, Thermomyces, 
Myceliophthora, Talaromyces which are anamorpnous and thus would 
be classified in Fungi Imperfecta 

Preferred basidiomycete phytases are those listed in 
WO 98/28409, in the very beginning of the section h6aded 
Detailed description of the invention". 

DNA sequences encoding the thirteen specifically listed 
model phytases and other znodel phytases can be prepared 
30 according to the teachings of each of the docu^nts listed under 
the brief description of the drawings. 
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A DNA sequence encoding a model phytase may be isolated 
from any cell or microorganism producing the phytase in 
question, using various methods well known in the art. First, a 
genomic DNA and/or cDNA library should be constructed using 
5 chromosomal DNA or messenger RNA from the organism that produces 
the phytase. Then, if the amino acid sequence of the phytase is 
known, homologous, labelled oligonucleotide probes may be 
synthesized and used to identify phytase-encoding clones from a 
genomic library prepared from the organism in question. 
10 Alternatively, a labelled oligonucleotide probe containing 
sequences homologous to a known phytase gene could be used as a 
probe to identify phytase-encoding clones, using hybridization 
and washing conditions of lower stringency. 

Yet another method for identifying phytaseencoding clones 
15 would involve inserting fragments of genomic DNA into an 
expression vector, such as a plasmid, transforming phytase- 
negative bacteria with the resulting genomic DNA library, and 
then plating the transformed bacteria onto agar containing a 
substrate for phytase thereby allowing clones expressing the 
20 phytase to be identified. 

Alternatively, the DNA sequence encoding the enzyme may be 
prepared synthetically by established standard methods, e.g. the 
phosphoroamidite method described by S.L. Beaucage and M.H. 
Caruthers (1981) or the method described by Matthes et al . 
25 (1984) . In the phosphoroamidite method, oligonucleotides are 
synthesized, e.g. in an automatic DNA synthesizer, purified, 
annealed, ligated and cloned in appropriate vectors. 

Finally, the DNA sequence may be of mixed genomic and syn- 
thetic origin, mixed synthetic and cDNA origin or mixed genomic 
30 and cDNA origin, prepared by ligating fragments of synthetic, 
genomic or cDNA origin (as appropriate, the fragments 
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corresponding to various parts of the entire ^ ^ 
accordance with standard techniques. The DNA sequence „ay also 
be prepared by polymerase chain reaction ( PCR, using specific 
pnmers, for instance as described in US 4, 683,202 or R.K. SaiJci 
5 et al. (1988) . 

DNA encoding the phytase variants of the present invention 
can be prepared by methods known in the art, such as Site- 
dxrected Mutagenesis. Once a DNA sequence encoding a model 
Phytase of interest has been isolated, and desirable sites for 
10 mutation identified, mutations may be introduced using synthetic 
oligonucleotides. These oligonucleotides contain nucleotide 
sequences flanking the desired mutation sites; mutant 
nucleotides are inserted during oligonucleotide synthesis. In a 
specific method, a single-stranded gap of DNA, bridging the 
15 Phytase-encoding sequence, is created in a vector carrying the 
phytase-encoding gene. Then the synthetic nucleotide, bearing 
the desired mutation, is annealed to a homologous portion of the 
single-stranded DNA. The remaining gap is then filled in with 
DNA polymerase I (Klenow fragment) and the construct is ligated 
20 using T4 ligase. a specific example of this method is described 
xn Morinaga et al . (1984). US 4,760,025 discloses the 
introduction of oligonucleotides encoding multiple mutations by 
performing minor alterations of the cassette. However, an even 
greater variety of mutations can be introduced at any one time 
25 by the Morinaga method because a multitude of oligonucleotides, 
of various lengths, can be introduced. 

Another method of introducing mutations into DNA sequences 
encoding a desired model phytase is described in Nelson and Long 
(1989). it involves a 3-step generation of a PGR fragment 
30 containing the desired mutation introduced by using a chemically 
synthesized DNA strand as one of the primers in the PGR 
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reactions. From the PCR-generated fragment, a DNA fragment 
carrying the mutation may be isolated by cleavage with 
restriction endonucleases and reinserted into an expression 
plasmid. 

5 Yet another method of mutating DNA sequences encoding a 

model phytase is Random Mutagenesis. Random mutagenesis is 
suitably performed either as localised or region-specific random 
mutagenesis in at least three parts of the gene translating to 
the amino acid sequence shown in question, or within the whole 

10 gene. 

The random mutagenesis of a DNA sequence encoding a model 
phytase may be conveniently performed by use of any method known 
in the art . 

In relation to the above, further aspects of the present 
15 invention relates to a method for generating a variant of a 
model phytase, wherein the variant preferably exhibits amended 
characteristics as described below, the method comprising: 

(a) subjecting a DNA sequence encoding the model phytase 
to Site-directed Mutagenesis, or the Nelson and Long PCR 

20 mutagenesis method or to Random Mutagenesis, 

(b) expressing the mutated DNA sequence obtained in step 

(a) in a host cell, and 

(c) screening for host cells expressing a phytase 
variant which has an altered property relative to the model 

25 phytase. 

When using Random Mutagenesis, step (a) of the above 
method of the invention is preferably performed using doped 
primers . 

For instance, the random mutagenesis may be performed by 
30 use of a suitable physical or chemical mutagenizing agent, by 
use of a suitable oligonucleotide, or by subjecting the DNA 
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sequence to PGR generated mutagenesis. Furthermore, the random 
mutagenesis may be performed by use of any combination of these 
mutagenizing agents. The mutagenizing agent may, e.g., be one 
which induces transitions, transversions, inversions, 
5 scrambling, deletions, and/or insertions. 

Examples of a physical or chemical mutagenizing agent 
suitable for the present purpose include ultraviolet (UV) 
irradiation, hydroxyzine , N-methyl-N • -nitro-N-nitrosoguanidine 
(MNNG) , O-methyl hydroxylamine, nitrous acid, ethyl methane 
10 sulfonate (EMS) , sodium bisulphite, formic acid, and nucleotide 
analogues. When such agents are used, the mutagenesis is 
typically performed by incubating the DNA sequence encoding the 
parent enzyme to be mutagenized in the presence of the 
mutagenizing agent of choice under suitable conditions for the 
15 mutagenesis to take place, and selecting for mutated DNA having 
the desired properties. 

When the mutagenesis is performed by the use of an 
oligonucleotide, the oligonucleotide may be doped or spiked with 
the three non-parent nucleotides during the synthesis of the 
20 oligonucleotide at the positions which are to be changed. The 
doping or spiking may be done so that codons for unwanted amino 
acids are avoided. The doped or spiked oligonucleotide can be 
incorporated into the DNA encoding the phytase enzyme by any 
published technique, using e.g. PC R, LCR or any DNA polymerase 
25 and ligase as deemed appropriate. 

Preferably, the doping is carried out using "constant 
random doping", in which the percentage of wild-type and 
mutation in each position is predefined. Furthermore, the 
doping may be directed toward a preference for the introduction 
30 of certain nucleotides, and thereby a preference for the 
introduction of one or more specific amino acid residues. The 
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doping may be made, e.g., so as to allow for the introduction 
of 90% wild type and 10% mutations in each position. An 
additional consideration in the choice of a doping scheme is 
based on genetic as well as protein-structural constraints. The 
5 doping scheme may be made by using the DOPE program which, inter 
alia, ensures that introduction of stop codons is avoided. 

When PCR-generated mutagenesis is used, either a 
chemically treated or non-treated gene encoding a model phytase 
is subjected to PGR under conditions that increase the mis- 

10 incorporation of nucleotides (Deshler 1992; Leung et al., 
Technique, Vol.1, 1989, pp. 11-15). 

A mutator strain of E. coli (Fowler et al., Molec. Gen. 
Genet., 133, 1974, pp. 179-191), S. cereviseae or any other 
microbial organism may be used for the random mutagenesis of the 

15 DNA encoding the model phytase by, e.g., transforming a plasmid 
containing the parent glycosylase into the mutator, strain, 
growing the mutator strain with the plasmid and isolating the 
mutated plasmid from the mutator strain. The mutated plasmid 
may be subsequently transformed into the expression organism. 

20 The DNA sequence to be mutagenized may be conveniently 

present in a genomic or cDNA library prepared from an organism 
expressing the model phytase. Alternatively, the DNA sequence 
may be present on a suitable vector such as a plasmid or a 
bacteriophage, which as such may be incubated with or otherwise 

25 exposed to the mutagenising agent. The DNA to be mutagenized 
may also be present in a host cell either by being integrated in 
the genome of said cell or by being present on a vector 
harboured in the cell. Finally, the DNA to be mutagenized may 
be in isolated form. It will be understood that the DNA 

30 sequence to be subjected to random mutagenesis is pre-ferably a 
cDNA or a genomic DNA sequence. 
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In some cases it may be convenient to amplify the mutated 
DNA sequence prior to performing the expression step b) or the 
screening step C ) . Such amplification may be performed in 
accordance with methods known in the art, the presently 
5 preferred method being PCR-generated amplif ication using 
oligonucleotide primers prepared on the basis of the DNA or 
ammo acid sequence of the parent enzyme. 

Subsequent to the incubation with or exposure to the 
mutagenising agent, the mutated DNA is expressed by culturing a 
10 suitable host cell carrying the DNA sequence under conditions 
allowing expression to take place. The host cell used for this 
purpose may be one which has been transformed with the mutated 
DNA sequence, optionally present on a vector, or one which was 
carried the DNA sequence encoding the parent enzyme during the 
15 mutagenesis treatment. Examples of suitable host cells are the 
following: gram positive bacteria such as Bacillus subtilis 
Bacillus licheniformis, Bacillus lentus, Bacillus brevis,' 
Bacillus stearothermophilus, Bacillus alkalophilus, Bacillus 
amyloliquefaciens, Bacillus coagulans, Bacillus circulans, 
20 Bacillus lautus, Bacillus megaterium, Bacillus thuringiensis, 
Streptomyces lividans or Streptomyces murinus; and gram - 
negative bacteria such as E. coli. 

The mutated DNA sequence may further comprise a DNA 
sequence encoding functions permitting expression of the mutated 
25 DNA sequence. 

The random mutagenesis may be advantageously localised to 
a part of the model phytase in question using Localized random 
mutagenesis. This may, e.g., be advantageous when certain 
regions of the enzyme have been identified to be of particular 

30 importance for a given nrnnprtv ^-f 4-u 

given property of the enzyme, and when modified 

are expected to result in a varHant- v, 

1 in a vari ant having improved properties. 
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Such regions may normally be identified when the tertiary 
structure of the parent enzyme has been elucidated and related 
to the function of the enzyme. 

The localized, or region-specific, random mutagenesis is 
5 conveniently performed by use of PCR generated mutagenesis 
techniques as described above or any other suitable technique 
known in the art. Alternatively, the DNA sequence encoding the 
part of the DNA sequence to be modified may be isolated, e.g., 
by insertion into a suitable vector, and said part may be 
10 subsequently subjected to mutagenesis by use of any of the 
mutagenesis methods discussed above. 

For region-specific random mutagenesis with a view to 
amending e.g. the specific activity of a model phytase, codon 
positions corresponding to the following amino acid residues 
15 from the amino acid sequences set forth in Fig. 1 may 
appropriately be targeted: 

Residues: 41-47, 68-80, 83-84, 115-118, 120-126, 128, 
149-163, 184-185, 191-193, 198-201e, 202-203, 205, 235-236, 238- 
239, 242-243, 270-279, 285, 288, 332-343, 364-367, 369-375, 394. 
20 Regions: 41-47, 68-80, 120-128, 149-163, 270-279, 332-343, 

364-375. 

The random mutagenesis may be carried out by the following 

steps : 

1. Select regions of interest for modification in the 

25 parent enzyme 

2. Decide on mutation sites and non-mutated sites in 

the selected region 

3. Decide on which kind of mutations should be carried 
out, e.g. with respect to the desired stability and/or 

30 performance of the variant to be constructed 

4. Select structurally reasonable mutations 
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5- Adjust the residues selected by step 3 with regard 
to step 4. 

6. Analyse by use of . suitabIe dope ^ 
nucleotide distribution. 
5 7. If necessary, adjust the wanted residues to genetic 

code realise,, e.g. taking into account constraints resulting 
from the genetic code, e.g. in order to avoid introduction of 
stop codons; the smied person will be aware that some codon 
combinations cannot be used in practice and will need to be 
10 adapted 

8. Make primers 

9. Perform random mutagenesis by use of the primers 

10. Select resulting phytase variants by screening for 
the desired improved properties. 

Suitable dope algorithms for use in step 6 are well known 
in the art. One such algorithm is described by Tomandl, D. et 
al., 1997, Journal of Computer-Aided Molecular Design 11-29-38 
Another algorithm is DOPE (Jensen, LJ, Andersen, KV, Svendsen, 
A, and Kretzschmar, T (1998) Nucleic Acids Research 26:697-702). 
20 A DNA sequence encoding a model phytase or a phytase 

variant of the invention can be expressed using an expression 
vector, a recombinant expression vector, which typically 
includes control sequences encoding a promoter, operator, 
rxbosome binding site, translation initiation signal, and, 
25 optionally, a repressor gene or various activator genes. 

The recombinant expression vector may be any vector which 
my conveniently be subjected to recombinant DNA procedures, and 
the choice of vector will often depend on the host cell into 
which it is to be introduced. Thus, the vector may be an 
30 autonomously replicating vector, e.g. a plasmid, a bacteriophage 
or an extra-chromosomal element. Alternatively, the vector may 
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be one which, when introduced into a host cell, is integrated 
into the host cell genome and replicated together with the 
chromosome (s) into which it has been integrated. 

In the vector, the DNA sequence should be operably . 
5 connected to a suitable promoter sequence. The promoter may be 
any DNA sequence which shows transcriptional activity in the 
host cell of choice and may be derived from genes encoding 
proteins either homologous or heterologous to the host cell. An 
example of a suitable promoter for directing the transcription 
10 of the DNA sequence encoding a phytase variant of the invention, 
especially in a bacterial host, is the promoter of the lac 
operon of E.coli. For transcription in a fungal host, examples 
of useful promoters are those derived from the gene encoding A. 

oryzae TAKA amylase. 
15 The expression vector of the invention may also comprise a 

suitable transcription terminator and,. in. eukaryotes, 
polyadenylation sequences operably connected to the DNA sequence 
encoding the phytase variant of the invention. Termination and 
polyadenylation sequences may suitably be derived from the same 

20 sources as the promoter. 

The vector may further comprise a DNA sequence enabling 
the vector to replicate in the host cell in question. Examples 
of such sequences are the origins of replication of plasmids 
pUC19, pACYC177, pUBllO, P E194, pAMBl and pIJ702. 
25 The vector may also comprise a selectable marker, e.g. a 

gene the product of which complements a defect in the host cell, 
such as the dal genes from B. subtilis or B. lichenif ormis, or 
one which confers antibiotic resistance such as ampicxllm 
resistance. Furthermore, the vector may comprise Aspergillus 
30 selection markers such as amdS, argB, niaD and sC, or the 
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selection may be accomplished by co-transformation, e.g. as 
described in WO 91/17243. 

The procedures used to ligate the DNA construct of the 
invention encoding a phytase variant, the promoter, terminator 
5 and other elements, respectively, and to insert them into 
suitable vectors containing the information necessary for 
replication, are well known to persons skilled in the art (cf 
for instance, Sambrook et al. (1989)). 

The cell of the invention, either comprising a DNA 
10 construct or an expression vector of the invention as defined 
above, is advantageously used as a host cell in the recombinant 
production of a phytase variant of the invention. The cell may 
be transformed with the DNA construct of the invention encoding 
the variant, conveniently by integrating the DNA construct (in 

15 one or more copies) in the host chromosome. This integration is 
generally considered to be an advantage as the DNA sequence is 
more likely to be stably maintained in the cell. Integration of 
the DNA constructs into the host chromosome may be performed 
according to conventional methods, e.g. by homologous or 

20 heterologous recombination. Alternatively, the cell may be 
transformed with an expression vector as described above in 
connection with the different types of host cells. 

An isolated DNA molecule or, alternatively, a "cloned DNA 
sequence" "a DNA construct," "a DNA segment" or "an isolated DNA 

25 sequence" refers to a DNA molecule or sequence which can be 
cloned in accordance with standard cloning procedures used in 
genetic engineering to relocate the DNA segment from its natural 
location to a different site where it will be replicated. The 
term refers generally to a nucleic acid sequence which is 

30 essentially free of other nucleic acid sequences, e.g., at least 
about 20% pure, preferably at least about 40% pure, more 
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preferably about 60% pure, even more preferably about 80% pure, 
most preferably about 90% pure, and even most preferably about 
95% pure, as determined by agarose gel electrophoresis . The 
cloning procedures may involve excision and isolation of a 
desired nucleic acid fragment comprising the nucleic acid 
sequence encoding the polypeptide, insertion of the fragment 
into a vector molecule, and incorporation of the recombinant 
vector into a host cell where multiple copies or clones of the 
nucleic acid sequence will be replicated. The nucleic acid 
sequence may be of genomic, cDNA, RNA, semisynthetic, synthetic 
origin, or any combinations thereof. 

The term "vector" is intended to include such 
terms/objects as "nucleic, acid constructs," "DNA constructs, 11 
expression vectors" or "recombinant vectors." 

The nucleic acid construct comprises a nucleic acid 
sequence of the present invention operably linked to one or more 
control sequences capable of directing the expression of the 
coding sequence in a suitable host cell under conditions- 
compatible with the control sequences. 

"Nucleic acid construct" is defined herein as a nucleic 
acid molecule, either single or double-stranded, which is 
isolated from a naturally occurring gene or which has been 
modified to contain segments of nucleic acid which are combined 
and juxtaposed in a manner which would not otherwise exist in 
nature. 

The term nucleic acid construct may be synonymous with the 
term expression cassette when the nucleic acid construct 
contains all the control sequences required for expression of a 
coding sequence of the present invention . 

The term "coding sequence" as defined herein primarily 
comprises a sequence which is transcribed into mRNA and 
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translated into a polypeptide of the present invention when 
placed under the control of the above mentioned control 
sequences. The boundaries of the coding sequence are generally 
determined by a translation start codon ATG at the 5' -terminus 
5 and a translation stop codon at the 3 '-terminus. A coding 
sequence can include, but is not limited to, DNA, cDNA, and 
recombinant nucleic acid sequences. 

The term "control sequences" is defined herein to include 
all components which are necessary or advantageous for 
10 expression of the coding sequence of the nucleic acid sequence. 
Each control sequence may be native or foreign to the nucleic 
acid sequence encoding the polypeptide. Such control sequences 
include, but are not limited to, a leader, a polyadenylation 
sequence, a propeptide sequence, a promoter, a signal sequence, 
15 and a transcription terminator. At a minimum, the control 
sequences include a promoter, and transcriptional and 
translational stop signals. The control sequences may be 
provided with linkers for the purpose of introducing specific 
restriction sites facilitating ligation of the control sequences 
20 with the coding region of the nucleic acid sequence encoding a 
polypeptide. 

A "host cell" or "recombinant host cell" encompasses any 
progeny of a parent cell which is not identical to the parent 
cell due to mutations that occur during replication. 

25 The cel1 is preferably transformed with a vector 

comprising a nucleic acid sequence of the invention followed by 
integration of the vector into the host chromosome. 

"Transformation" means introducing a vector comprising a 
nucleic acid sequence of the present invention into a host cell 

30 so that the vector is maintained as a chromosomal integrant or 
as a self-replicating extra-chromosomal vector. Integration is 



WO 99/49022 



PCT/DK99/00153 



32 

generally considered to be an advantage as the nucleic acid 
sequence is more likely to be stably maintained in the cell. 
Integration of the vector into the host chromosome may occur by 
homologous or non-homologous recombination as described above. 
5 The host cell may be a unicellular microorganism, e.g., a 

prokaryote, or a non-unicellular microorganism, e.g., a 
eukaryote. Examples of a eukaryote cell is a mammalian cell, an 
insect cell, a plant cell or a fungal cell. Useful mammalian 
cells include Chinese hamster ovary (CHO) cells, HeLa cells, 
10 baby hamster kidney (BHK) cells, COS cells, or any number of 
other immortalized cell lines available, e.g., from the American 
Type Culture Collection. 

In a preferred embodiment, the host cell is a fungal cell. 
Fungal cells may be transformed by a process involving 
15 protoplast formation, transformation of the protoplasts, and 
regeneration of the cell wall in a manner known per se. 

The present invention also relates to a transgenic plant, 
plant part, such as a plant seed, or plant cell, which has been 
transformed with a DNA sequence encoding the phytase of the 
20 invention so as to express or produce this enzyme. Also 
compositions and uses of such plant or plant part are within the 
scope of the invention, especially its use as feed and food or 
additives therefore, along the lines of the present use and 
food/feed claims. 

25 The transgenic plant can be dicotyledonous or 

monocotyledonous, for short a dicot or a monocot. Of primary 
interest are such plants which are potential food or feed 
components and which comprise phytic acid. A normal phytic acid 
level of feed components is 0.1-100 g/kg, or more usually 0.5-50 

30 g/kg, most usually 0.5-20 g/kg. Examples of monocot plants are 
grasses, such as meadow grass (blue grass, Poa) , forage grass 
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such as festuca, lolium, temperate grass, such as Agrostis, and 
cereals, e.g. wheat, oats, rye, barley, rice, sorghum and maize 
(corn) . 

Examples of dicot plants are legumes, such as lupins, pea, 
5 bean and soybean, and cruciferous (family Brassicaceae) , such as 
cauliflower, oil seed rape and the closely related model 
organism Arabidopsis thaliana. 

Such transgenic plant etc. is capable of degrading its own 
Phytic acid, and accordingly the need for adding such enzymes to 
10 food or feed comprising such plants is alleviated. Preferably, 
the plant or plant part, e.g. the seeds, are ground or milled,' 
and possibly also soaked before being added to the food or feed 
or before the use, e.g. intake, thereof, with a view to adapting 
the speed of the enzymatic degradation to the actual use. 
15 If desired, the plant produced enzyme can also be 

recovered from the plant. In certain cases the recovery from the 
plant is to be preferred with a view to securing a heat stable 
formulation in a potential subsequent pelleting process. 

Examples of plant parts are stem, callus, leaves, root, 
20 fruits, seeds, tubers etc. But also any plant tissue is included 
in this definition. 

Any plant cell, whatever the tissue origin, is included in 
the definition of plant cells above. 

Also included within the scope of the invention are the 
25 progeny of such plants, plant parts and plant cells. 

The skilled man will know how to construct a DNA 
expression construct for insertion into the plant in question, 
paying regard i.a. to whether the enzyme should be excreted in a 
tissue specific way. Of relevance for this evaluation is the 
30 stability (pH-stability, degradability by endogenous proteases 
etc.) of the phytase in the expression compartments of the 
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plant. He will also be able to select appropriate regulatory 
sequences such as promoter and terminator sequences, and signal 
or transit sequences if required (Tague et al, Plant, Phys . , 86, 
506, 1988) . 

5 The plant, plant part etc. can be transformed with this 

DNA construct using any known method. An example of such method 
is the transformation by a viral or bacterial vector such as 
bacterial species of the genus Agrobacterium genetically 
engineered to comprise the gene encoding the phytase of the 

10 invention. Also methods of directly introducing the phytase DNA 
into the plant cell or plant tissue are known in the art, e.g. 
micro injection and electroporation (Gasser et al, Science, 244, 
1293; Potrykus, Bio/Techn. 8, 535, 1990; Shimamoto et al, 
Nature, 338, 274, 1989). 

15 Following the transformation, the transformants are 

screened using any method known to the skilled man, following 
which they are regenerated into whole plants . 

These plants etc. as well as their progeny then carry the 
phytase encoding DNA as a part of their genetic equipment. 

20 In general, reference is made to WO 9114782A and WO 

9114772A. 

Agrobacterium tumefaciens mediated gene transfer is the 
method of choice for generating transgenic dicots (for review 
Hooykas & Schilperoort , 1992. Plant Mol. Biol. 19: 15-38), 

25 however it can also be used for transforming monocots. Due to 
host range limitations it is generally not possible to transform 
monocots with the help of A. tumefaciens. Here, other methods 
have to be employed. The method of choice for generating 
transgenic monocots is particle bombardment (microscopic gold or 

30 tungsten particles coated with the transforming DNA) of 
embryonic calli or developing embryos (Christou, 1992. Plant J. 
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2: 275-281; Shimamoto, 1994. Curr. Opin. Biotechnol. 5: 158-162; 
Vasil et al., 1992. Bio/Technology 10: 667-674). 

Also other systems for the delivery of free DNA into these 
Plants, including viral vectors (Joshi & Joshi, 1991. FEES Lett. 
5 281: 1-8), protoplast transformation via polyethylene glycol or 
electroporation (for review see Potyrkus, 1991. Annu. Rev. Plant 
Physiol. Plant Mol. Biol. 42: 205-225), microinjection of DNA 
into mesophyll protoplasts (Crossway et al., 1986. Mol. Gen 
Genet. 202: 79-85), and macroinj ection of DNA into young floral 
10 tillers of cereal plants (de la Pena et al., 1987. Nature 325: 
274-276) are preferred methods. 

In general, the cDNA or gene encoding the phytase variant 
of the invention is placed in an expression cassette (e.g. 
Pietrzak et al., 1986. Nucleic Acids Res. 14: 5857-5868) 
15 consisting of a suitable promoter active in the target plant and 
a suitable terminator (termination of transcription). This 
cassette (of course including a suitable selection marker, see 
below) will b e transformed into the plant as such in case of 
monocots via particle bombardment. In case of dicots the 
20 expression cassette is placed first into a suitable vector 
providing the T-DNA borders and a suitable selection marker 
which in turn are transformed into Agrobacterium tumefaciens. 
Dicots will be transformed via the Agrobacterium harbouring the 
expression cassette and selection marker flanked by T-DNA 
25 following standard protocols (e.g. Akama et al., 1992. Plant 
Cell Reports 12: 7-11). The transfer of T-DNA from Agrobacterium 
to the Plant cell has been recently reviewed (Zupan & Zambryski, 
1995. Plant Physiol. 107: 1041-1047). Vectors for plant 
transformation via Agrobacterium are commercially available or 
30 can be obtained from many labs that construct such vectors' (e.g. 
Deblaere et al., 1985. Nucleic Acids Res. 13: 4777-4788; for 
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review see Klee et'al., 1987. Annu. Rev. Plant Physiol. 38: 467- 
486) . 

Available plant promoters: Depending on the process under 
manipulation, organ- and/or cell-specific expression as well as 
5 appropriate developmental and environmental control may be 
required. For instance, it is desirable to express a phytase 
cDNA in maize endosperm etc. The most commonly used promoter has 
been the constitutive 35S-CaMV promoter Franck et al., 1980. 
Cell 21: 285-294) . Expression will be more or less equal 
10 throughout the whole plant. This promoter has been used 
successfully to engineer herbicide- and pathogen-resistant 
plants (for review see Stitt & Sonnewald, 1995. Annu. Rev. Plant 
Physiol. Plant Mol. Biol. 46: 341-368). Organ-specific promoters 
have been reported for storage sink tissues such as seeds, 
15 potato tubers, and fruits (Edwards & Coruzzi, 1990. Annu. Rev. 
Genet. 24: 275-303), and for metabolic sink tissues such as 
meristems (Ito et al., 1994. Plant Mol. Biol. 24: 863-878). 

The medium used to culture the transformed host cells may 
be any conventional medium suitable for growing the host cells 
20 in question. The expressed phytase may conveniently be secreted 
into the culture medium and may be recovered therefrom by well- 
known procedures including separating the cells from the medium 
by centrifugation or filtration, precipitating proteinaceous 
com-ponents of the medium by means of a salt such as ammonium 
25 sulphate, followed by chromatographic procedures such as ion 
exchange chromatography, affinity chromatography/ or the like. 

Preferred host cells are a strain of Fusarium, Hansenula, 
Trichoderma or Aspergillus, in particular a strain of Fusarium 
graminearum, Fusarium venenatum, Fusarium cerealis, Fusarium sp. 
30 having the identifying characteristic of Fusarium ATCC 20334, as 
further described in PCT/US/95/07743, Hansenula polymorpha, 
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Trichoderma harzianum or Trichoderma reesei, Aspergillus niger 
or Aspergillus oryzae. 

References for expression in Hansenula polymorphs: 
Gellissen, G. , Piontek, M . , Dahlems, U., Jenzelewski, v., 
5 Gavagan, J.E., DiCosimo, R. , Anton, D.I. & Janowicz, Z.A. (1996)' 
Recombinant Hansenula polymorpha as a biocatalyst: coexpression 
of the spinach glycolate oxidase (GO) and the S. cerevisiae 
catalase T (CTT1) gene. Appl . Microbiol. Biotechnol. 46, 46-54. 

Some more specific uses of the phytase variants according 
10 to the invention appear from PCT/DK97/00568, the last pages of 
the detailed description of the invention section. 

In a preferred embodiment, the phytase variant of the 
invention is essentially free of other non-phytase polypeptides, 
e.g., at least about 20% pure, preferably at least about 40% 
15 pure, more preferably about 60% pure, even more preferably about 
80% pure, most preferably about 90% pure, and even most 
preferably about 95% pure, as determined by SDS-PAGE. Sometimes 
such polypeptide is alternatively referred to as a "purified" 
and/or "isolated" phytase. 
20 A phytase polypeptide which comprises a phytase variant of 

the invention includes fused polypeptides or cleavable fusion 
polypeptides in which another polypeptide is fused at the N- 
terminus or the C-terminus of the polypeptide or fragment 
thereof. A fused polypeptide is produced by fusing a nucleic 
25 acid sequence (or a portion thereof) encoding another 
polypeptide to a nucleic acid sequence (or a portion thereof) 
encoding a phytase variant of the present invention. Techniques 
for producing fusion polypeptides are known in the art, and 
include, ligating the coding sequences encoding the polypeptides 
30 so that they are in frame and that expression of the fused 
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polypeptide is under control of the same promoter (s ) and 
terminator. 

A "feed" and a "food," respectively, means any natural or 
artificial diet, meal or the like or components of such meals 
5 intended or suitable for being eaten, taken in, digested, by an 
animal and a human being, respectively. 

The phytase variant of the invention may exert its effect 
in vitro or in vivo, i.e. before intake or in the stomach of the 
individual, respectively. Also a combined action is possible. 
10 A phytase composition according to the invention always 

comprises at least one phytase of the invention. 

Generally, phytase compositions are liquid or dry. 
Liquid compositions .need not contain anything more than 
the phytase enzyme, preferably in a highly purified form. 
15 Usually, however, a stabilizer such as glycerol, sorbitol or 
mono propylen glycol is also added. The liquid composition may 
also comprise other additives, such as salts, sugars, 
preservatives, pH-adjusting agents, proteins, phytate (a phytase 
substrate) . Typical liquid compositions are aqueous or oil-based 
20 slurries. The liquid compositions can be added to a food or feed 
after an optional pelleting thereof. 

Dry compositions may be spray-dried compositions, in which 
case the composition need not contain anything more than the 
enzyme in a dry form. Usually, however, dry compositions are so- 
25 called granulates which may readily be mixed with e.g. food or 
feed components, or more preferably, form a component of a pre- 
mix. The particle size of the enzyme granulates preferably is 
compatible with that of the other components of the mixture. 
This provides a safe and convenient means of incorporating 
30 enzymes into e.g. animal feed. 
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Agglomeration granulates are prepared using agglomeration 
technique in a high shear mixer (e.g. Lodige) during which a 
filler material and the enzyme are co-agglomerated to form 
granules. Absorption granulates are prepared by having cores of 
5 a carrier material to absorb /be coated by the enzyme. 

Typical filler materials are salts such as disodium 
sulphate. Other fillers are kaolin, talc, magnesium aluminium 
silicate and cellulose fibres. Optionally, binders such as 
dextrins are also included in agglomeration granulates.' 
10 Typical carrier materials are starch, e.g. in the form of 

cassava, corn, potato, rice and wheat. Salts may also be used. 

Optionally, the granulates are coated with a coating 
mixture. Such mixture comprises coating agents, preferably 
hydrophobic coating agents, such as hydrogenated palm oil and 
is beef tallow, and if desired other additives, such as calcium 
carbonate or kaolin. 

Additionally, phytase compositions may contain other 
substituents such as colouring agents, aroma compounds, 
stabilizers, vitamins, minerals, other feed or food enhancing 
20 enzymes, i.e. enzymes that enhances the nutritional properties 
of feed/food, etc. This is so in particular for the so-called 
pre-mixes . 

A "food or feed additive" is an essentially pure compound 
or a multi component composition intended for or suitable for 

25 being added to food or feed. In particular it is a substance 
which by its intended use is becoming a component of a food or 
feed product or affects any characteristics of a food or feed 
product. It is composed as indicated for phytase compositions 
above. A typical additive usually comprises one or more 

30 compounds such as vitamins, minerals or feed enhancing enzymes 
and suitable carriers and/or excipients. 
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In a preferred embodiment, the phytase compositions of the 
invention additionally comprises an effective amount of one or 
more feed enhancing enzymes, in particular feed enhancing 
enzymes selected from the group consisting of a-galactosidases, 
5 p-galactosidases, in particular lactases, other phytases, p- 
glucanases, in particular endo-p-1, 4-glucanases and endo-p- 
1, 3 (4) -glucanases, cellulases, xylosidases, galactanases, in 
particular arabinogalactan endo-1, 4 -p-galactosidases and 
arabinogalactan endo-1, 3-p-galactosidases, endoglucanases, in 

10 particular endo-1 , 2-p-glucanase, endo-1, 3-a-glucanase, and endo- 
1, 3-p-glucanase, pectin degrading enzymes, in particular 
pectinases, pectinesterases, pectin lyases, polygalacturonases, 
arabinanases , rhamnogalacturonases , rhamnogalacturonan acetyl 
esterases , rhamnogalacturonan-a-rhamnosidase , pectate lyases , 

15 and ot-galacturonisidases, mannanases, p-mannosidases, mannan 
acetyl esterases, xylan acetyl esterases, proteases, xylanases, 
arabinoxylanases and lipolytic enzymes such as lipases, 
phospholipases and cutinases. 

The animal feed additive of the invention is supplemented 

20 to the mono-gastric animal before or simultaneously with the 
diet. Preferably, the animal feed additive of the invention is 
supplemented to the mono-gastric animal simultaneously with the 
diet. In a more preferred embodiment, the animal feed additive 
is added to the diet in the form of a granulate or a stabilized 

25 liquid. 

An effective amount of phytase in food or feed is from 
about 10-20.000; preferably from about 10 to 15.000, more 
preferably from about 10 to 10.000, in particular from about 100 
to 5.000, especially from about 100 to about 2.000 FYT/kg feed 
30 or food. 
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Examples of other specific uses of the phytase of the 
invention is in soy processing and in the manufacture of 
inositol or derivatives thereof. 

The invention also relates to a method for reducing 
5 phytate levels in animal manure , wherein ^ ^ ^ ^ 

feed comprising an effective amount of the phytase of the 

invention. 

Also comprised in this invention is the use of a phytase 
of the invention during the preparation of food or feed 
XO preparations or additives, i.e. the phytase exerts its phytase 
activity during the manufacture only and is not active in the 
frnal food or feed product. This aspect is relevant for instance 
m dough making and baking. , 

The invention relates to a phytase variant which, when 
15 aligned according to Fig. 1, is amended as compared tQ a model 
Phytase in at least one of the following positions, using the 
position numbering corresponding to P_lycii: 

24; 27; 31; 33; 39; 40; 41; 42; 43; 44; 45; 46; 47; 49; 51; 56; 
58; 59; 61; 62; 68; 69; 70; 71; 72; 73; 74; 75; 76; 77; 78; 79- 
20 80; 81; 82; 83; 84; 88; 90; 102; 115; 116; 117; 118; 119; 120- 
121; 122; 123; 124; 125; 126; 127; 128; 132; 143; 148; 149; 150; 
151; 152; 153; 154; 155; 156; 157; 158; 159; 160; 161; 162; 163- 
170f; 170g; 171; 172 , 173; 184; 185; 186; 187; 187a; 190; 191; 
192; 193; 194; 195; 198; 199; 200; 201; 201a; 201b; 201c; 201d; 
25 201e; 201f; 202; 203; 203a; 204; 205; 211; 215; 220; 223; 228; 
232; 233; 234; 235; 236; 237; 238; 239; 242; 243; 244; 246; 
251e; 253; 256; 260; 264; 265; 267; 270; 271; 272; 273; 274; 
275; 276; 277; 278; 279; 280; 283; 285; 287; 288; 292; 293-' 3 02 
304; 332; 333; 334; 335; 336; 337; 338; 339; 340; 341; 342- 343 
30 348; 349; 352; 360; 362; 364; 365; 366; 367; 368; 369; 370- 371 
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372; 373; 374; 375; 376; 383k; 387; 393; 394; 396; 404; 409; 
411; 412; 413; 417; 421; 431. 

From these variants we expect amended characteristics, 
preferably amended activity characteristics. In fact, for 
5 several variants such amended characteristics have already been 
shown (see the experimental part) . Like above, "amended" means 
as compared to the model phytase. "Amended activity 
characteristics" means amended in at least one phytase activity 
related respect, such as (non-exclusive list) : pH stability, 
10 temperature stability, pH profile, temperature profile, specific 
activity (in particular in relation to pH and temperature), 
substrate specificity, substrate cleavage pattern, substrate 
binding, position specificity, the velocity and level of release 
of phosphate from corn, reaction rate, phytate degradation 
15 rate) , end level of released phosphate reached. 

Preferred amended activity characteristics are amended 
specific activity, preferably increased, and preferably 
increased at a pH of 3, 4, 5, or 6; amended pH or temperature 
profile; and/or amended, preferably increased, thermostability, 
20 e.g. of an increased melting temperature as measured using DSC. 

Preferred phytase variants are: Phytase variants which, 
when aligned according to Fig. 1, are amended as compared to a 
model phytase in at least one of the following positions, using 
the position numbering corresponding to P_lycii: 
25 43; 44; 47; 51; 58; 62; 78; 80; 83; 88; 90; 102; 143; 148; 153; 
154; 186; 187a; 195; 198; 201e; 204; 205; 211; 215; 220; 242; 
244; 251e; 260; 264; 265; 267; 270; 273; 278; 302; 336; 337; 
339; 352; 365; 373; 383k; 404; 417. 

The following variants of A_fumigatus constitute a 
. 30 subgroup: Q43L; Q270L; G273D,K; N336S; A205E; Y278H; Q43L+Q270L; 
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Q43L+Q270L+G273D; Q43L + Q270L + G273D + N33 6S; G273K+A205E; 

G273K+A205E+Y278H (see EP 0897010) . 

Generally, -variants of the invention can be deduced or 
identified as follows: Looking at the alignment according to 
5 Fig. .1, comparing two sequences, one of which is a model phytase 
with improved properties, identifying amino acid differences in 
relevant positions/areas, and transferring (substituting with) 
from the model to the other phytase sequence the amino acid in a 
relevant position. 

The invention also relates to a process for preparing a 
phytase variant which include, the above method, and further 
includes the deducement and synthesis of the corresponding DNA 
sequence, the transformation of a host cell, the cultivation of 
the host cell and the recovery of the phytase variant. 
15 Relevant positions/areas include those mentioned below in 

relation to important phytase activity characteristics such as 
specific activity, thermostability, P H activity/stability. 

The present invention also relates to phytase variants 
(varied according to a model phytase as defined herein) which 
20 are obtainable, preferably obtained, by the process outlined 
above and which are expected to exhibit an amended 
characteristic/property, preferably does exhibit such amended 
characteristic, e.g. an improved specific activity. 

At least the basidiomycete model phytases P_lycii and 
25 T_pubescens exhibit a high specific activity (as determined 
using the method of Example 2 herein) . 

This is an example of a desired property which can be 
transferred to other phytases, e.g. the other phytases listed in 
Fig- 1, in particular to the A_ P ediades and the ascomycete 

30 phytases SUCh as A fuminat-nc » -r- 

«_rumigatus, A-ficuum, consphyA, by a 
deducement process such as the one mentioned above. 
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Thus, amended specific activity, in particular an improved 
specific activity, in particular at low pH and/or high 
temperature, is expected from variants, which have been amended 
in relevant areas, viz. (i) in the amino acid residues which 
5 point into the active site cleft; or (ii) in the amino acid 
residues in the close neighbourhood of these active site 
residues. Preferably, close neighbourhood means within 10A from 
the active site residues. 

From the pdb file HHP (Brookhaven Database entry of 
10 18.03.98 re HHP, Structure of Phosphomonoesterase, D.Kostrewa; 
or as published in Nature Structural Biology, 4, 1997, p. 185- 
190) , active site regions can be identified, using the program 
INSIGHTII from Molecular Simulations MSI, San Diego, California, 
and using the subset command, an "active site shell" can be 
15 defined comprising those amino acid residues which lie close to 
the catalytic residues, defined as H59, D33 9 and R58 in A. 
ficuum phytase (corresponding to Peniophora numbers H71, D335 
and R70, respectively). An "active site shell (10A) " comprises 
those residues which lie within 10A from the above catalytic 
20 residues. 

The residues within 10A from H71 and D335 are the 
following (using Peniophora numbers): 41-47, 68-77, 115-118, 
120-126, 128, 149-163, 185, 191-193, 199, 243, 270-271, 273-275, 
277-279, 288, 332-343, 364-367, 369-375, 394 ("the active site 

25 shell (10A) ") . 

Preferably, a "substrate binding shell" can also be 
defined which comprises those residues which are in close 
proximity to the substrate binding site and which can therefore 
be expected to be in contact with the substrate. 

30 This information can be deduced as described above, by 

docking a sugar analogue to phytin into the active site cleft 
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(the residues making up the surface of the active site) if a 
sugar without any phosphate groups is doeked into the active 
site cleft, e . g . alpha-D-glucose (chair conformation, structure 
provided by the INSIGHTII program) , using a fi xed distance as 
5 shown below, the residues pointing towards the active site cleft 
can be extracted using the subset command and using a distance 
of 10A from the substrate analogue. Alternatively, the compound 
xno8itol-l,4, 5 -tr±phosphate (Brookhaven database file Idjx 
Inositol-l,4,5-triphosphate) can be docked into the active site 
10 cleft. This compound and glucose, however, are more or less 
superimposable. 

The distances in Angstr6m (A) are: From oxygen atom in 
position 6 of the alpha-D-glucose to 
atom ND1 of H59: 5.34 
!5 atom NH2 of R58: 6.77 

atom NH2 of R142: . 5.09 

atom ND2 of N340: 3.00 
atom ND1 of H59: 7.76 
atom NH2 of R58: 8.58. 
20 (the Peniophora numbers of the above residues are: H71, 

R70, R155, N336, H71 and R70, respectively). 

In this way, the residues in contact with the substrate 
are identified as follows (Peniophora numbers): 43-44; 70-80; 
83-84; 115; 153; 155-156; 184; 191-192; 198-202; 205; 235; 238- 
25 242; 270; 272-273; 275-277; 332-336; 338; 369; 371 ("the 
substrate binding shell (10A) ") . 

Variants being amended in one or more of (1) the active 
site shell or (2) the substrate binding shell, are strongly 
expected to have an amended specific activity. This leads to the 
30 following joint grouping of positions (still Peniophora numbers 
and 10A shells): 41-47, 68-80, 83-84, 115-118, 120-126, 128, 
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149-163, 184-185, 191-193, 198-201e, 202-203, 205, 235-236, 238- 
239, 242-243, 270-279, 285, 288, 332-343, 364-367, 369-375, 394. 

Preferably, the active site shell and the substrate 
binding shell are defined as described above using the 
5 basidiomycete model phytases of Fig. 1, the Peniophora phytase 
being a preferred model. A deducement of corresponding variants 
of other model phytases is possible using the alignment of Fig. 
1. 

In a preferred embodiment, a distance of 5A is used in the 
10 subset command, thus defining active site and substrate binding 
shells of a more limited size, e.g. an active site shell 
comprising the residues 43-44, 69-74, 117, 125, 155-156, 159, 
274, 332-340, 370-374 (5A from H71 and D335) , "active site 
shell (5A)". 

15 Generally the active site shell and substrate binding 

shell regions form the basis for selecting random mutagenesis 
regions. Examples of preferred random mutagenesis regions are 

regions 69-74, 332-340, 370-374, doping to be added (a 5A 
approach) ; and 

20 regions 57-62, 142-146, 337-343, doping to be added (a 10A 

approach) . 

It is presently contemplated that any amendment in either 
of these positions will lead to a phytase of amended 
characteristics, e.g. of an amended specific activity. 
25 The above expression "any amendment in either of the 

positions" is considered fully equivalent to listing each 
position and each substitution, e.g. as follows for the above 
sub-group 41-47: 

41A,C,D,E, F,G,H,I,K,L,M,N,P,Q,R, S,T,V,W,Y; 
30 42A,C,D / E,F,G,H,I,K,L,N3,N,P f Q,R,S,T,V,W,Y; 
43A,C, D,E,F,G,H,I,K,L,M,N,P,Q,R,S,T,V,W,Y; 



WO 99/49022 

PCT/DK99/00153 

47 

44A 'C,D,E,F,G,H,I,K,L,M,N,P,Q,R, S ,T,V,W,Y; 
45A,C,D,E,F,G,H,I, K ,L,M,N,P,Q,R, S ,T,V,W,Y; 
46 A,C,D,E,F,G,H,I,K,L,M,N,P,Q,R,s,T,V,W,Y; 
47 ^C,D,E,F,G,H,I,K,L,M,N,P,Q,R,s,T,V,W,Y. 
5 in a preferred embodiment, amended specific activity is 

expected from the following variants: 

42S,G; 43A,C f D,E f F,G f H / I,K,L,M r N / P / .Q,R, Sf T f V f W # Y; 45D,S; 

47Y,F; 51E,A; 75W, F; 78S,D; 79G ; 80K,A; 83I,Q; 84Q,V; 116S; 

118V,L; 119E; 120L; 122A; 123N,T; 125S; 126H,S; 127Q, E ; 128A T • 
.10 151A,S; 152G; 153D,Y; 154Q,D,G; 157V; 158D,A; 159T; 160A,' S; 

161T,N; 162N; 163W; 184Q,S; 18 6A,E; 198A,N; 200G,V; 201D; 

deletions of one or more of 201a, 201b, 201c, 201d, 201e, 201f - 

preferably all; 202S; 205Q,E; 235Y,L; 238L,M; 242P; 270Y,A,L; 

271D; 273D, K ; 275F,Y; 278T,H; 332F; 336S; 337T,Q; 339V; 340P,A; 
15 343A,S; 364W,F; 365V,L; 366D,V; 367K; 368K; 3 691, L; 370V; 373S; 

374A; 375H; 376M; 393V. 

Particularly preferred variants are the following: 78S; 
79G; 80A; 83I,Q; 84Q,V; 1 98 A,N; 200G,V; 201D; deletions in one 
or more of 201a, 201b, 201c, 201d, 201e, 201f - preferably all 
20 deletions; 202S; 205Q,E ; 235Y,L; 238L,M; 242P, 273D; 275F,Y. 

Other particularly preferred variants are the following: 
43A,C,D,E,F,G,H,I,K,L, M ,N,P,Q, R , S , T/ V,W,Y; in particular 43M,P; 
75W,F; 80K; 153D; 184Q,S; 270Y,A; 332F; 3691, L. 

The following variants are especially preferred: 
25 43L,G,N,V,A,I,T; 78D; 153Y; 154G; 270L; 273D,K. Double and 
triple variants (43L/270L); (43L/270L/273D) ; (43L/78D) and 
(43L/153Y/154G) are also especially preferred. Other preferred 
variants are 205E; 278H; 336S. 

These especially preferred single, double and triple 
30 variants are preferably variants of model phytases which can be 
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aligned to Fig.l, in particular variants of the specific model 
phytases listed in Fig. 1. 

At least co'nsphyA is known to have a high thermostability. 
Still further, the thermostability of P_lycii is rather high. 
5 This is an example of a desired property which can be 

transferred to other phytases, e.g. the other phytases listed in 
Fig. 1, in particular to the basidiomycete phytases such as 
P lycii and A_pediades, by a deducement process such as the one 
mentioned above . 

lo Amended thermostability, in particular improved 

thermostability, is expected on this background from the 

following variants: 

39H,S; 40L,N; 43P; 47Y,F; 49P; 51E,A; 56P; 58D; 61R; 62V; 

80K; 83A; 84Y; 172P; 184P; 195T; 198A; 204V; 211L; 223D; 236Y; 
15 242P; 246V; 253P; 264R; 265Q; 280A,P; 283P; 287A; 292F,Y; 293A; 

302R; 304P; 337S; 348Y; 387P; 396R; 409R; 411K; 412R; 417E; 

421F, Y. 

The following variants of amended thermostability are 
particularly preferred: 39S; 40N; 47Y,F; 51A; 83A; 195T; 204V; 
20 211L; 242P; 265A. 

Further variants of amended thermostability are the 
following: 42G; 43T,L,G; 44N; 58K,A; 59G; 621; 69Q; 75F; 78D; 
79G; 80A; 81A,G; 82T; 83K,R; 841; 881; 90R,A; 102Y; 115N; 118V; 
122A; 123Q,N; 125M,S; 126V, S; 127N,Q; 128S,A; 143N,K; 148V, I; 
25 154S; 158D; 170fH; ' 170gA; 171T,N; 172N; 173W, 184S; 186A; 187A; 
187aS;. 193S; 195V, L; 198V; 201E; 201eT; 202A, 203aT; 204A; 211V, 
215P,A; 220L,N; 223H; 228N; 232T; 322E; 235T; 236N; 242S; 244D; 
251eQ,E; 256D; 2641; 260A,H; 265A; 267D; 270G; 271D; 273K,D; 
278T,H; 287T; 293V; 302H; 337T,G; 3381; 339V, I; 340A; 352K; 
30 365A,S; 366S; 367A; 369L; 373S,A; 374S; 376M; 383kE,Q; 404G,A; 
411T; 417R; 431E. 
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Other concepts of the invention, which can be expected to 
impart an improved thermostability to a phytase, are as follows 
- considering the HHP structure previously referred to and 
transferring via an alignment according to Fig. 1 as outlined 
5 herein: 

(A) Introduction of prolin residues in spatial positions where 
the prolin special dihedral angles are satisfied and the 
hydrogen bonding network are not hampered and no steric clashes 
are observed, 

10 (B) Filling up holes: By substitution for bigger residues in 
internal cavities an improvement in stability can often be 
obtained. 

(C) Cystin bridge: Cystin- bridges will often make the proteins 
more rigid and increase the energy of unfolding. 
15 Further variants from which amended thermostability is 

expected according to these concepts of (A) to (C) are: 27P, 
31Y, 132F, 1321, 132L, 184P, 186P, 190P, 280P, 343F, 3431, 343L, 
349P, 362P and (33C and 24C) . 

Concept (A): 27P, 184P, 186P, 190P, 349P, 362P. 
20 Concept (B): 343F,I,L; 31Y; 132F,I,L; 273F. 

Concept (C) : 33C/24C. 

Amended pH activity or stability, preferably stability, in 
particular at low pH, in particular improved, is another desired 
property which can be transferred by aligning according to Fig. 

25 1 and transferring from models of improved pH profiles to other 
phytases - as outlined above. 

Other concepts of the invention, which can be expected to 
impart an improved stability at low pH to a phytase, are as 
follows - considering the HHP structure previously referred to 

30 and transferring via an alignment according to Fig. 1 as 
outlined herein: 
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(D) Surface charges: Better distribution at low pH, to avoid 
cluster of negative or positive, and to avoid too close same 
charged residues. 

(E) Prevent deamidation: Surface exposed Q or N in close 
5 contact to negative charged residues. 

Phytase variants having improved pH stability/activity at 
low pH are expected to be: 39H; 39Q; 80A; 203R; 271N; 51R; 154S; 
185S; 194S; 194T; 288L; 2881; 288F; 360R; 173Q,S; 204Q,S; 
303K,S; 81Q,E. 

10 Concept (D): 203R, 271N, 51R, 185S, 360R; 173Q,S; 204Q,S; 

303K,S; 81Q,E. 

Concept (E): 154S; 194S,T; 288L,I,F. 

A preferred model phytase for these concepts of (D) and 
(E) is P_lycii. 

15 Experimentally proven to have a lowered pH optimum is: 

Variant 80A of ascomycete phytases, in particular of A_fumigatus 
and consphyA. 

Especially preferred single, double and triple variants 
are 43L; (43L/270L) and (43L/270L/273D) . These variants have a 
20 changed pH profile. They are preferably variants of the specific 
model phytases listed in Fig. 1. 

For all preferred variants listed above: 

the stability is preferably amended at high temperature, 
viz. in the temperature range of 50-100°C, in particular 60- 
25 90°C, more preferably in the range of 70-90 °C; 

the activity is preferably amended in a temperature range 
relevant for the use in the gastro-intestinal system of animals, 
e.g. 30-40°C, more preferably 32-38°C, most preferably in the 
range of 35-38°C; 
30 the stability is preferably amended at low pH, viz. in the 

pH range of pH 1.5-7, preferably 2-6, more preferably 3-5; 
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the activity is preferably amended in the pH range of pfl 
1.5-5.5, more preferably at P H 2.5-4.5, still more preferably 3- 



5 



Tests for amended phytase characteristics, such as those 
5 mentioned above, are well known in the art and any such test can 
be used to compare the performance of the phytase variants with 
the phytase models. 

A preferred test for specific activity is given in Example 
2. Preferred tests for pH and temperature activity and stability 
10 are given in Example 3. An even more preferred test for thermal 
stability is the DSC method of Example 4. 

WO 98/28409 discloses tests for various other parameters, 
too, such as position specificity. All the tests of WO 98/28409 
are preferred tests. 

15 Generally, of course all these tests can be conducted at 

desired pH values and temperatures. 

In the dependent claims, some preferred phytase variants 
based on five of the thirteen herein specifically disclosed 
model phytases are specified. 
20 In an analogous way other preferred variants based on the 

remaining eight specifically disclosed model phytases can easily 
be deduced by combining the suggested amendments with each of 
the corresponding sequences of Fig. 1. These preferred variants 
are specifically included in the present invention, and they are 
25 easily deducemed, viz. the following: 

Variants of a model phytase derived from Paxillus, 
preferably Paxillus involutus, preferably derived from strain 
CBS 100231, preferably variants of P_involtus-Al, the sequence 
of which is shown at Fig. 2, said variants comprising at least 
30 one of the following amendments: 

024C; T27P; F31Y; I33C; R39H,S,Q; N40L; S42G; 
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P43A,C,D,E,F,G,H,I,K,L,M,N,Q,R,S,T,V,W,Y; Y44N; S45D; 
Y47F; A51E,R; A58D;K; Q61R; I62V; F75W; S78D; A80K; T81Q,E,G,A; 
R83A,I,Q,K; I84Y,Q,V; L88I; K90R,A; F102Y; S115N; D116S; V118L; 
P119E; F120L; A123N,T,Q; S125M; F126H,S,V; D127Q,E,N; A128T,S; 
5 A132F, I,L; I148V; D151A,S; S153D,Y; D154Q,S,G; D158A; S159T; 
A160S; T161N; ()170fH; ()170gA; S171N; H172P,- N173Q,S; P184Q,S; 
Q185S; T186A,E,P; G187A; ()187aS; T190P,A; D193S; N194S,T; 
M195T,V,L; A198N,V; G200V; D201E; ()201eT; S202A; D203R,K,S; 
P203aV,T; Q204E,S,A,V; V205E; V211L; S215A,I; L220N; A223D,H; 
10 D233E; F235Y,L,T; N236Y; L237F; V238L,M; A242P,S; M244D; 
()251eE,Q; D253P; T256D; P260A,H; E264R,I; A265Q; A267D; 
G270Y,A,L; D271N; D273K; F275Y; T278H; Y280A f P; E283P; V287A,T; 
Q288L,I,F; Y292F; V293A; N302R,H; A304P; N336S; L337T,Q,S,G; M 
3381; V339I; A340P; S343A / F,I,L; F348Y; R349P; A352K; P360R; 
15 R362P; W364F; R365V,L,A,S; T366D,V,S; S367K,A; S368K; L369I; 
S373A; G374A,S; R375H; ()383kQ,E; T387P; Q396R; G404A; L409R; 
T411K; L412R; E417R; F421Y. 

Variants of a model phytase derived from a species of the 
genus Paxillus, preferably the species Paxillus involutus, 
20 preferably derived from strain CBS 100231, preferably variants 
of P_involtus-A2, the sequence of which is shown at Fig. 3, said 
variants comprising at least one of the following amendments: 
P24C; I27P; F31Y; I33C; R39H,S,Q; N40L; S42G; 

P43A / C,D / E,F,G,H / I / K,L,M,N,Q / R,S,T,V,W,Y; Y44N; S45D; 
25 Y47F; A51E,R; A58D,K; E61R; I62V; F75W; S78D; A80K; A81Q,E,G; 
R83A,I,Q/R,K; I84Y,Q,V; L88I; K90R,A; F102Y; S115N; D116S; 
V118L; P119E; F120L; A123N,T,Q; S125M; F126H,S / V; D127Q,E,N; 
A128T,S; V132F,I,L; D143N; I148V; D151A,S; S153D,Y; D154Q,S,G; 
D158A; A160S; T161N; ()170fH; ()170gA; S171N; R172P; N173Q,S; 
30 P184Q,S; Q185S; T186A,E,P; G187A; ()187aS; T190P,A; D193S; 
N194S,T; M195T,V,L; A198N,V; G200V; E201D; ()201eT; S202A; 



WOW/49022 PCT/DK99/00153 

53 

D203R,K,S; P203aV,T; Q204E,S,A,V; V205E; S211L,V; S215A, P 
L220N; A223D,H; A232T; F235Y,L,T; N236Y; L237F; V238L,M; P242S 
M244D; ()251eE,Q; D253P; T256D; P260A,H; E264R,I; A265Q; A267D 
G270Y,A,L; D271N; D273K; F275Y; T278H; Y280A,P; A283P; V287A,T 
5 Q288L,I,F; Y292F; I293A / V; N302R,H; A304P; N336S; L337T,Q,S,G 
M338I; V339I; 340P,A; A343S, F, I,L; F348Y; R349P; A352K; P360R 
R362P; W364F; L365V,A,S; T366D,V,S; S367K,A; S368K; V369I,L 
S373A; R375H; ()383kQ,E; T387P; Q396R; G404A; L409R; A411K,T 
L412R; E417R; Y421F. 
10 Variants of a model phytase derived from a species of the 

genus Trametes, preferably the species Trametes pubescens, 
preferably derived from strain CBS 100232, preferably variants 
of T_pubescens, the sequence of which is shown at Fig. 4, said 
variants comprising at least one of the following amendments: 
15 R24C; T27P; L31Y; V33C; Q39H,S; S40L,N; S42G; 

M43A,C,D,E,F,G,H,I,K,L,N,P,Q,R,S,T,V,W,Y; Y44N; S45D 
Y47F; A51E,R; A58D,K; S59G; Q61R; I62V; F75W; S78D; A80K 
A81Q,E,G; R83A,I,Q,K; I84Y,Q,V; V88I; K90R,A; L102Y; D115N 
V118L; T123N,Q; S125M; S126H,V; E127Q,N; A128T,S; A132F,I,L 
20 D143N; V148I; S151A; S153D,Y; 01540,3,6; A158D; A160S; N161T 
()170fH; ()170gA; S171N; S172P; N173Q,S; S184Q,P; E185S 
A186E,P; G187A; ()187aS; T190P,A; N194S,T; M195T,V,L; A198N,V 
G200V; ()201eT; S202A; D203R,K,S; P203aV,T; Q204E,S,A,V/ V205E 
Q211L,V; P215A; L220N; G223D,H; D233E; Y235L,T; N236Y; L237F 
25 L238M; P242S; E244D; ()251eE,Q; E253P; Q260A / H; D264R,I; A265Q 
A267D; A270Y,L,G; D271N; D273K; F275Y; T278H; Y280A,P; V287A,T 
Q288L,I,F; Y292F; I293A,V; A302R,H; N304P,A; N336S; Q337T,S,G 
M338I; V339I; A340P; S343A,F,I,L; F348Y; N349P; A352K; P360R 
R362P; F364W; L365V,A,S; V366D,S; K367A; I369L; A373S; A374S 
30 R375H; ()383kQ,E; Q387P; A396R; G404A; V409R; T411K; L412R 
E417R; Y421F. 
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Variants of a model phytase derived from a species of the 

genus Aspergillus, preferably the species Aspergillus nidulans, 

preferably derived from strain DSM 9743, preferably variants of 

A_nidulans, the sequence of which is shown at Fig. 10, said 
5 variants comprising at least one of the following amendments: 

V24C; A27P; H39S,Q; V40L,N; G42S; 

Q43A / C,D,E,F / G,H,I,K,L,M,N,P,R,S,T,V,W,Y; Y44N; S45D; 

Y47F; S49P; E51A,R; V56P; H58D,K,A; E61R; V62I; S69Q; Y75W,F; 

E78D,S; S79G; K80A; S81Q,E,A,G; K82T; A83I,Q,K,R; Y84Q,V,I; 
10 A90R; D115N; D116S; T118V,L; I119E; F120L; E122A; N123T,Q; 

M125S; V126H,S; D127Q,E,N; S128A,T; F132I,L; K143N; I148V; 

S151A; S153D,Y; D154Q,S,G; A158D; S159T; A160S; E161T,N; K162N; 

F163W; G170fH; S170gA; ()171N; ()172P; K173Q,S; P184Q, S; E185S; 

I186A,E,P; D187A; G187aS; T190P,A; H193S; S194T; S198A,N,V; 
15 E200G,V; N201D, E; . D201e ( ) ; E201e(),T; R201f() (a deletion of at 

least one of 201d, 201e, 201f, preferably all); A202S; 

D203R,K,S; E203aV,T; I204Q, E, S, A, V; I211L,V; P215A; L220N; 

D223H; K228N; E232T; N233E; I235Y,L,T; Y236N; L237F; M238L; 

S242P; M246V; E251eQ; A256D; E260A,H; L264R, I; Q270Y,A,L,G; 
20 S271D,N; S273D,K; Y275F; G278T,H; A280P; A287T; Q288L,I,F; 

F292Y; T293A,V; Q302R,H; P304A; N336S; S337T,Q,G; M338I; I339V; 

S340P,A; F343A,S,I,L; N349P; Q352K; S360R; Q362P; Y364W,F; 

A365V,L,S; A366D,V,S; S367K,A; W368K; T369I,L; G373S,A; A374S; 

R375H; A376M; E383kQ; A404G; T411K; L412R; E417R; F421Y; K431E. 
25 Variants of a model phytase derived from a species of 

Aspergillus, preferably Aspergillus terreus, preferably derived 

from strain CBS 220.95, preferably variants of A_terreus, the 

sequence of which is shown at Fig. 12, said variants comprising 

at least one of the following amendments: 
30 G24C; V27P; H39S,Q; K40L,N; G42S; 
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L43A,C,D,E,F,G,H,I,K,M,N,P,Q f R,S,T,V,W,Y; Y44N; A45D,S 
Y47F; S49P; Q51E,A,R; V56P; P58D,K,A; D59G; H61R; I62V; A69Q 
S75W,F; H78D,S; S79G; K80A; T81Q,E,A,G; A83I /Q , K/ R; Y84Q,V,I 
A90R; E115N; E116S; T118V,L; P119E; F120L; R122A; N123T,' Q 
5 L125S,H; R126H,S,V; D127Q,E,N; L128A,T,S; F132I,L; H143N; V148I 
T151A,S; D152G; A153D,Y; S154D,Q,G; H157V; E158D,A; S159T 
A160S; E161T,N; K162N; F163W; H173Q,S; P184Q,S; E185S 
G186A,E,P; S187A; A187aS; T190P,A; H193S; S194T; L195T,V 
A198N,V; E200G,V; S201D,E; S201d(); T201e(); V201f(); G202S,A 
10 D203R,K /S ; D203aV,T; A204Q,E,S,V; V205E; V211L; A215P; L220N 
D223H; Q228N; D232T; D233E; V235Y,L,T; N236Y; L237F; M238L 
P242S; E244E; T251eE,Q; A260H; T264 R/ I; Q265A; N267D; L270Y,A,G 
S271D,N; K273D; Y275F; H2.78T; G280A,P; V287A,T; Q288L,I,F 
W292F,Y; A293V; Q302H; P304A; N337T,Q,S,G; L338I; V339I 
15 S340P,A; W343A,S,F,I,L; N349P; A352K; S360R; S362P; Y364W,F 
A365V,L,S; A366D,V,S; A367K; W368K; T369I,L; A373S; A374S 
R375H; A376M; R383kQ,E; P404A,G; K411T; A417E / R; F421Y; A431E. 

Variants of a model phytase derived from a species of 
Talaromyces, preferably the species Talaromyces thermophilus, 
20 preferably derived from strain ATCC 20186 or ATCC 74338, 
preferably variants of T_thermo, the sequence of which is shown 
at Fig. 13, said variants comprising at least one of the 
following amendments: 
H24C; V27P; H39S,Q; S40L,N; G42S; 
25 Q43A,C,D,E,F,G,H,I,K,L,M,N,P,R,S,T,V,W,Y; Y44N; S45D; 

F47Y; S49P; A51E,R; V56P; Q58D, K, A; N59G; K61R; I62V; Y75W,F; 
S78D; S79G; K80A; T81Q /E , A/ G; E82T; L83A, I, Q, R, K; Y84Q,V,I; 
R90A; D116S; T118V,L; P119E; F120L; E122A; N123T,Q; M125S; 
I126H,S,V; Q127E,N; L128A,T,S; F132I,L; V148I; S151A; S153D,Y; 
30 D154Q,S,G; I157V; A158D; S159T; G160A,S; R161T,N; L162N; F163W; 
S170gA; D171N; K172P; H173Q,S; E184Q,S,P; E185S; G186A,E,P; 
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D187A; T190P,A; T193S; G194S,T; S195T,V,L; V198A,N; E200G,V; 
. D201E; S201d(); S201e(),T; S201f(); G202S,A; H203R,K,S; 
D203aV,T; A204Q / E / S / V; Q205E; Q211L,V; A215P; I220N,L; H223D; 
D228N; S232T; D233E; P235Y,L,T; Y236N; M237F; D238L,M; P242S; 
5 E244D; L246V; ()251eE,Q; A256D; Q260A / H; Q264R, I; A265Q; 
Q270Y, A,L,G; S271D,N; G273D,K; Y275F; N278T,H; G280A, P; A287T; 
Q288L,I,F; F292Y; V293A; H302R; P304A; N336S; T337Q,S,G; M338I; 
T339V,I; S340P,A; A343S, F, I, L; N349P; A352K; S360R; E362P; 
Y364W,F; S365V,L,A; A366D,V,S; A367K; W368K; T369I,L; G373S,A; 
10 G374A,S; R375H; A376M; D383kQ,E; E404A; K411T; R417E; F421Y. 

Variants of a model phytase derived from a species of 
Thermomyces, preferably the species Thermomyces lanuginosus, 
preferably derived from strain DBS 58 6.94, preferably variants 
of T_lanuginosa, the sequence of which is shown at Fig. 14, said 
15 variants comprising at least one of the following amendments: 
K24C; ()27P; ()31Y; ()33C; R39H,S,Q; H40L,N; G42S; 

Q43A,C,D,E, F,G,H, I , K, L, M, N, P, R, S, T, V, W, Y; Y44N; S45D; 
F47Y; S49P; A51E,R; V56P; K58D,A; V62I; S69Q; Y75W,F; A78D,S; 
H79G; K80A; S81Q,E,A,G; E82T; V83A, I, Q, K, R; Y84Q,V,I; L88I; 
20 R90A; F102Y; D115N; N116S; T118V,L; R119E; F120L; E122A; 
E123N,T,Q; M125S; M126H,S,V; E127Q,N; S128A,T; F132I,L; E143N; 
V148I; A151S; S153D,Y; A154D,Q,S,G; I157V; A158D; S159T; A160S; 
E161T,N; F162N; F163W; R17 0fH; S170gA; K172P; D173Q,S; S184Q,P; 
E185S; E186A,P; T187A; G187aS; T190P,A; G193S; L194S,T; T195V,L; 
25 A198N,V; E200G,V; E201D; A201d(); P201e(),T; D202S,A; P203R,K,S; 
T203aV; Q204E,S,A,V; P205E; V211L; R215A, P; I220L,N; H223D; 
E232T; D233E; P235Y,L,T; L236Y,N; M238L; P242S; Q251eE; H256D; 
Q260H; M264R, I; A265Q; Y270A,L,G; T271D,N; D273K; Y275F; H278T; 
G280A,P; A283P; S287A; R288L,I,F; F292Y; V293A; G302R,H; P304A; 
30 N336S; T337Q,S,G; M338I; T339V,I; G340P,A; S343A, F, I, L; N349P; 
P360R; T362P; Y364W,F; A365V,L,S; A366D,V,S; S367K,A; W368K; 
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T369I,L; A373S; A374S; R375H; A376M; E383kQ; R404A,G; R411K,T; 
K417E,R; F421Y; D431E. 

Variants of a model phytase derived from a species of 
Myceliophthora, preferably the species Myceliophthora 
5 thermophila, preferably derived from strain ATCC 48102 or ATCC 
74340, preferably variants of M_thermophila, the sequence of 
which is shown at Fig. 7, said variants comprising at least one 
of the following amendments: 
S24C; F31Y; H39S,Q; F40L,N; G42S; 
10 Q43A,C,D,E, F,G,H, I, K, L, M,N, P, R, S,T, V, W, Y; Y44N; S45D; 

Y47F; S49P; P51E,A,R; I56P; D58K,A; D59G; E61R; V62I; S69Q; 
A75W,F; L78D,S; K79G; R80K,A; A81Q,E,G; A82T; S83A, I, Q, K, R; 
Y84Q,V,I; R90A; D115N; E116S; T118V,L; R119E; T120L; Q122A; 
Q123N,T; M125S; V126H,S; N127Q,E; S128A,T; F132I,L; K143N; 
15 V148I; A151S; Q153D,Y; D154Q,S,G; H158D,A; S159T; A160S; 
E161T,N; Gl70fH; S170gA; T171N; F163W; V172P; R173Q,S; P184Q,S; 
E185S; T186A,E,P; G187aS; T190P,A; N193S; D194S,T; L195T,V; 
A198N,V; E200G,V; E201D; G201a(); P201b(); Y201c(); S201d() 
T201e(); I201f(); G202S,A; D203R,K,S; D203aV,T; A204Q,E,S,V; 
20 Q205E; 121^^; P215A; V220N,L; N223D,H; A232T; D233E; 
V235Y,L,T; A236Y,N; L237F; M238L; P242S; E244D; A251eE,Q; R256D; 
E260A,H; R264I; A265Q; Q270Y,A,L,G; S271D,N; K273D; Y275F; 
Y278T,H; P280A; T287A; Q288L,I,F; F292Y; V293A; ()302R,H; P304A; 
N336S; D337T,Q,S,G; M338I; M339V,I; G340P,A; G343A, S, F, I, L; 
25 D349P; P352K; D360R; E362P; Y364W,F; A365V,L,S; A366D,V,Sj 
S367K,A; W368K; . A369I, L; A373S; A374S; R375H; I376M; E383kQ; 
E387P; G404A; M409R; T411K; L412R; E417R; F421Y; D431E. 

This invention also provides a new phytase which has been 
derived from a strain of Cladorrhinum, viz. C. f oecundissimum. 
30 Accordingly, the invention also relates to a polypeptide having 
phytase acitivity and which comprises SEQ ID NO: 2 or the mature 
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part (amino acids nos 16-495) thereof; or a polypeptide being at 
least 70, more preferably 75, 80, 85, 90, 95% homologous 
thereto; homology meaning similarity, preferably identity, and 
being determined using the program GAP and the settings as 
5 defined hereinabove. And the invention relates to a DNA 
construct which encodes a polypeptide having phytase activity, 
said DNA construct comprising a DNA molecule which comprises 
SEQ ID NO:l or nucleotides nos. 20-70 and 207-1560 thereof; or 
nucleotides nos. 20-70 and 207-1563 thereof; or nucleotides nos 
10 65-70 and 207-1560 thereof; or nucleotides nos. 65-70 and 207- 
1563 thereof; or a DNA construct or molecule which is at least 
70, 75, 80, 85, 90, 95 % homologous to either of these 
nucleotide sequences; homology meaning similarity, preferably 
identity, and being determined using computer programs known in 
15 the art such as GAP provided in the GCG program package (Program 
Manual for the Wisconsin Package, Version 8, August 1996, 
Genetics Computer Group, 575 Science Drive, Madison, Wisconsin,' 
USA 53711) (Needleman, S.B. and Wunsch, CD., (1970), Journal 
of Molecular Biology, 48, 443-453). Using GAP with the following 
20 settings for DNA sequence comparison: GAP creation penalty of 
5.0 and GAP extension penalty of 0.3. The invention also relates 
to a DNA construct which hybridizes with any of the above DNA 
sequences under the conditions mentioned hereinabove. 

25 EXAMPLES 
Example 1 

Phytase activity assay (PYT) 

Phytase activity can be measured using the following 

assay: 

30 10 ul diluted enzyme samples (diluted in 0.1 M sodium acetate, 
0.01 % Tween20, pH 5.5) are added into 250 pi 5 mM sodium 
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phytate (Sigma) in 0.1 M sodium acetate, 0.01 % Tween20, pH 5.5 
(pH adjusted after dissolving the sodium phytate; the substrate 
is preheated) and incubated for 30 minutes at 37 °C. The reaction 
is stopped by adding 250 ul 10 % TCA and free phosphate is 
5 measured by adding 500 pi 7.3 g FeS04 in 100 ml molybdate 
reagent (2.5 g (NH 4 ) 6 Mo 7 0 24 .4H 2 0 in 8 ml H 2 S0 4 diluted to 250 ml). 
The absorbance at .750 nm is measured on 200 ul samples in 96 
well microtiter plates. Substrate and enzyme blanks are 
included. A phosphate standard curve is also included (0-2 mM 
10 phosphate) . 1 FYT equals the amount of enzyme that releases 1 
umol phosphate/min at the given conditions. 

Example 2 

Test for specific activity 

15 The specific activity can be determined as follows: 

A highly purified sample of the phytase is used (the 

purity is checked beforehand on an SDS poly acryl amide gel 

showing the presence of only one component) . 

The protein concentration in the phytase sample is 
20 determined by amino acid analysis as follows: An aliquot of the 

phytase sample is hydrolyzed in 6N HC1, 0.1% phenol for 16 h at 

110 C in an evacuated glass tube. The resulting amino acids are 

quantified using an Applied Biosystems 420A amino acid analysis 

system operated according to the manufacturers instructions. 
25 From the amounts of the amino acids the total mass - and thus 

also the concentration - of protein in the hydrolyzed aliquot 

can be calculated. 

The activity is determined in the units of FYT. One FYT 

equals the amount of enzyme that liberates 1 micromol inorganic 
30 phosphate from phytate (5 mM phytate) per minute at pH 5.5, 

37 °C; assay described e.g, in example 1. 
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The specific activity is the value of FYT/mg enzyme 
protein . 

Example 3 

5 Test for temperature and pH activity and stability 

Temperature and pH activity and stability can be 
determined as follows: 

Temperature profiles (i.e. temperature activity 
relationship) by running the FYT assay of Example 1 at various 
10 temperatures (preheating the substrate) . 

Temperature stability by pre-incubating the phytase in 0.1 
M sodium phosphate, pH 5.5 at various temperatures before 
measuring the residual activity. 

The pH-stability by incubating the enzyme at pH 3 (25 mM 
15 glycine-HCl) , pH 4-5 (25 mM sodium acetate), pH 6 (25 mM MES) , 
pH 7-9 (25 mM Tris-HCl) for 1 hour at 40°C, before measuring the 
residual activity. 

The pH-profiles (i.e. pH activity relationship) by running 
the assay at the various pH using the same buffer-systems (50 
20 mM, pH re-adjusted when dissolving the substrate) . 

Example 4 

DSC as a preferred test for thermostability 

The thermostability or melting temperature, Tm, can be 
25 determined as follows: 

In DSC the heat consumed to keep a constant temperature 
increase in the sample-cell is measured relative to a reference 
cell. A constant heating rate is kept (e.g. 90°C/hour). An endo- 
thermal process (heat consuming process - e.g. the unfolding of 
30 an enzyme/protein) is observed as an increase in the heat 
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transferred to the cell in order to keep the constant 
temperature increase. 

DSC can be performed using the MC2-apparatus from 
MicroCal. Cells are equilibrated 20 minutes at 20°C before 
5 scanning to 90°C at a scan rate of 90°/h. Samples of e.g. around 
2.5 mg/ml phytase in 0.1 M sodium acetate, pH 5.5 are loaded. 



Example 5 

Phytase variants of amended activity characteristics 

Variants of an Aspergillus fumigatus model phytase (a wild 
10 type phytase derived from strain ATCC 13073) were prepared as 
described in EP 98104858.0 (EP-A-08 97010) , examples 2-3 and 5, 
and the phytase activity was determined as described in example 
7 thereof. pH- and temperature optimum and melting point was 
determined as described in examples 9 and 10 of EP 98113176.6 
15 (EP-A-0897985) . 

In Table 1, variants of improved specific activity at pH 
5.0 are listed. Table 2 lists variants of improved relative 
activity at pH 3.0, and Table 3 lists variants of improved 
thermostability (temperature optimum, e.g. determined by DSC). 

20 

Table 1 



Amended in position 
no. 


Substitution into 


Specific activity at 
pH 5.0 (U/mg) 


43 


43L 


83.4 


43N 


45.5 


43T 


106.9 


431 


91.2 


43V 


35.0 


43A 


27.3 


43G 


59.6 
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43 and 270 


43L, 270L 


88.7 


4 3 and 270 and 273 


43L, 270L, 273D 


92.3 


4 3 and 78 


43L, 78D 


118 .5 


4 3 and 153 and 154 


43L, 153Y, 154G 


193.0 


A. fumigatus wild- 
type phytase 




26.5 




laoie z 

AuiexicLeci xxi pusi uxon 


ouJus to. LiiLion xxiuo 


Relative phytase 
activity at pH 3 . 0 


205 


205E 


41% 


273 


273K 


61% 


278 


278H 


75% 


273 and 205 


273K, 205E 


65% 


273 and 278 


273K, 278H 


100% 


273 and 205 and 278 


273K, 205E, 278H 


96% 


A. fumigatus wild- 
type phytase 




32% 




Table 3 

Amended in position 
no . 


Substitution into 


Tempera- 
ture 
optimum 
(°C) 


Tm (°C) 
(DSC) 


43 and 47 and 88 and 
102 and 220 and 242 
and 267 


43T, 47Y, 881, 102Y, 
220L, 242P, 267D 


60 


67 


as above plus 51 and 
302 and 337 and 373 
and 115 


as above plus 51A, 
302H, 337T, 373A, 
115N 


63 
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A. fumigatus wild- 




55 


62.5 


type phytase 









10 



Example 6 

Further phytase variants of amended activity characteristics 

Variants of the ascomycete consensus sequence "conphys" of 
Fig. 9 were prepared as described in EP 9811317 6.6 (EP-A- 
5 0897985), examples 4-8. Phytase activity, including pH- and 
temperature optimum, and melting point was determined as 
described in examples 9 and 10, respectively, thereof. 

The tables below list variants of amended activity 
characteristics, viz. 

Table 4 variants of improved specific activity at pH 6.0; 
Table 5 variants of amended pH optimum (the pH-optimum 
indicated is an approximate value, determined as that pH-value 
(selected from the group consisting of pH 4.0; 4.5; 5.0; 5.5; 
6.0; 6.5; and 7;0) at which the maximum phytase activity was 
15 obtained) ; 

Table 6 a variant of improved thermostability (expressed 
by way of the melting point as determined by differential 
scanning calorimetry (DSC)); and 

ICable 7 variants of amended thermostability (temperature 
20 optimum); a « + " or indicates a positive or a negative, 

respectively, effect on temperature optimum of up to 1°C; and a 
and x> - means a positive or a negative, respectively, 



effect on temperature 


optimum of between 1 


and 3°C. 


Table 4 






Amended in position 


Substitution into 


Specific activity at 


no. 




pH 6.0 (U/mg) 


43 


43T 


130 
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A *3T 


one 


Conphys 
Table 5 

Amended in position 
no . 


Substitution into 


62 

pH optimum 
around 


43 


4 3T 


6.0 


43L 


5.5 


43G 


6.5 


43 and 44 


43L, 44N 


6.0 


43T, 44N 


5.5 


Conphys 
Table 6 

Amended in position 
no . 


Substitution into 


6.0 

Tm (°C) 


43 


43T 


78.9 


Conphys 
Table 7 

Amended in position 
no • 


Substitution into 


78.1 

Temperature optimum 
oime no men u. 


R1 

Zj X 


7V 
f\ 


4- 


D O 


IS. 


i 




IN 


4. 


195 


L 


+ + 


201e 


T 


+ + 


244 


D 


+ 


264 


I 


+ 


302 


H 


+ 
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65 



337 


T 


++ 


352 


K 


+ 


373 


A 


++ 


47 


F 




62 


I 




83 


K 




90 


R 




143 


N 




148 


V 




186 


A 




187a 


s 




198 


V 




204 


A 




211 


V 




215 


P 




251e 


Q 




2 60 


A 




2 65 


A 




339 


V 




365 


A 




383k 


E 




404 


G 




417 


R 




Conphys 




0 
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Table 8 



Amended in position 
no . 


Substitution 
into 


Tm (°C) (DSC) 


Specific 
activity at 
pH 5.0 (U/mg) 


43 and 51 and 220 
and 244 and 264 and 
302 and 337 and 352 
and 373 


51A, 220N, 
244D, 2641, 
302H, 337T, 
352K, 373A, 
43T 


84.7 


105 


as above plus 80 


as above plus 
80A 


85.7 


180 


Conphys 




78.1 


30 



Example 7 

Cloning of a phytase of Cladorrhinum f oecundissimuiu 

DNA encoding a phytase from Cladorrhinum f oecundissimum 
CBS 427.97 has been cloned, and the enzyme isolated and 
purified, essentially as described in WO 98/28409. 

Fig. 15 shows the DNA sequence of the Hindlll/Xbal cloned 
PCR product in pA2phy8. The cloned PCR product is amplified from 
the genomic region encoding Cladorrhinum f oecundissimum CBS 
427.97 phyA gene. The putative intron is indicated by double 
underline of the excision-ligation points in accordance with the 
GT-AG rale (R. Breathnach et al. Proc. Natl. Acad. Sci. USA 75 
(1978) pp4853-4857 ) . The restrictions sites used for cloning are 
underlined. 

According to the SignalP VI. 1 prediction (Henrik Nielsen, 
Jacob Engelbrecht, Stren Brunak and Gunnar von Heijne: 
5 "Identification of prokaryotic and eukaryotic signal peptides 
and prediction of their cleavage sites," Protein Engineering 10, 
1-6 (1997)), the signal peptide part of the enzyme corresponds 
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to amino acids nos. 1-15, accordingly the mature enzyme is amino 
acids nos. 16-495. 

The enzyme exhibits a pH optimum around pH 6 with no ac- 
tivity at the low pH (pH 3), but significant activity up until 
5 pH 7.5/ thus it is a more alkaline phytase as compared to the 
Aspergillus ficuum phytase. 

A temperature optimum around 60 °C was found at pH 5.5. 
Thus, this phytase is more thermostable than the A. ficuum phy- 
tase. 

10 

Example 8 

Alignment of a new model phytase according to Pig. l 

The phytase sequence of Cladorrhinum f oecundissimum as 
disclosed in Example 7 is compared with the 13 model phytases of 
15 Fig. 1 using GAP version 8 referred to above with a GAP weight 
of 3.000 and a GAP lengthweight of 0.100. Complete amino acid 
sequences are compared. The M_thermophila phytase sequence turns 
up to be the most homologous sequence, showing a degree of 
similarity to the C. f oecundissimum sequence of 70.86%. 

Still using the GAP program and the parameters mentioned 
above, the phytase sequence "C_f oecundissimum" is now aligned to 
the "M-thermophila" phytase - see Fig. 16. The average match is 
0.540;, the average mismatch -0.396; quality 445.2; length 505; 
ratio 0.914; gaps 9; percent similarity 70.860; percent identity 
25 53.878. 

In a next step, see Fig. 17, the C_f oecundissimum is 
pasted (or it could simply be written) onto the alignment of 
Fig. 1 as the bottom row, ensuring that those amino acid 
residues which according to the alignment at Fig. 16 are 
30 identical (indicated by a vertical line) or similar (indicated 
by one or two dots) are placed above each other. At 5 places 
along the sequence, the C_f oecundissimum sequence comprises 



20 
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"excess" amino acid residues, which the alignment of Fig. 1 does 
not make room for. At Fig. 17, these excess residues are 
transferred onto a next row (but they can be included in the 
multiple alignment and numbered as described previously in the 
5 position numbering related paragraphs (using the denotations a, 
b, c etc. ) . 

Corresponding variants of the phytase of C__f oecundissimum 
are then easily deduced on the basis of Fig. 17. Some examples: 
The variants generally designated "80K,A" and "43T" in 
10 C f oecundissimum correspond to "K80A" and XX Q43T," respectively. 
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CLAIMS 

1. A phytase variant which, when aligned according to Fig. 1, 
is amended as compared to a model phytase in at least one of the 
following positions, using the position numbering corresponding 
5 to P_lycii: 

24; 27; 31; 33; 39; 40; 41; 42; 43; 44; 45; 46; 47; 49; 51; 56; 

58; 59; 61; 62; 68; 69; 70; 71; 72; 73; 74; 75; 76; 77; 78; 79; 

80; 81; 82; 83; 84; 88; 90; 102; 115; 116; 117; 118; 119; 120; 

121; 122; 123; 124; 125; 126; 127; 128; 132; 143; 148; 149; 150; 
.0 151; 152; 153; 154; 155; 156; 157; 158; 159; 160; 161; 162; 163; 

170f; 170g; 171; 172; 173; 184; 185; 186; 187; 187a; 190; 191; 

192; 193; 194; 195; 198; 199; 200; 201; 201a; 201b; 201c; 201d; 

201e; 201f; 202; 203; 203a; 204; 205; 211; 215; 220; 223; 228; 

232; 233; 234; 235; 236; 237; 238; 239; 242; 243; 244; 246; 
5 251e; 253; 256; 260; 264; 265; 267; 270; 271; 272; 273; 274; 

275; 276; 277; 278; 279; 280; 283; 285; 287; 288; 292; 293; 302; 

304; 332; 333; 334; 335; 336; 337; 338; 339; 340; 341; 342; 343; 

348; 349; 352; 360; 362; 364; 365; 366; 367; 368; 369; 370; 371; 

372; 373; 374; 375; 376; 383k; 387; 393; 394; 396; 404; 409; 
0 411; 412; 413; 417; 421; 431. 

2. A phytase variant which, when aligned according to Fig. 1, 
comprises at least one of the following amendments as compared 
to a model phytase, using the position numbering corresponding 
5 to the phytase of P_lycii: 

24C; 27P; 31Y; 33C; 39H,S,Q; 40L,N; 42S,G; 

43A,C,D,E,F,G,H,I,K / L,M,N,P,Q,R,S,T,V f W,Y; 44N; 45D,S; 47Y,F; 
49P; 51E,A,R; 56P; 58D,K,A; 59G; 61R; 62V,I; 69Q; 75W, F; 78D,S; 
79G; 80K,A; 81A,G,Q,E; 82T; 83A,I,K,R,Q; 84I,Y,Q,V; 881; 90R,A; 
) 102Y; 115N; 116S; 118V, L; 119E; 120L; 122A; 123N,Q,T; 125M,S; 
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126H,S,V; 127Q,E,N; 128A,S,T; 132F,I,L; 143N; 148V,I; 151A,S; 

152G; 153D,Y; 154D,Q,S,G; 157V; 158D,A; 159T; 160A,S; 161T,N; 

162N; 163W; 170fH; 170gA; 171N; 172P; 1'73Q,S; 184Q,S,P; 185S; 

186A,E,P; 187A; 187aS; 190A, P; 193S; 194S,T; 195T,V,L; 198A,N,V; 
5 200G,V; 201D,E; 201a(); 201b(); 201c(); 201d(); 201e(); 201f(); 

201eT; 202S,A; 203R,K, S; 203aV,T; 204Q,E, S, A,V; 205E; 211L,V; 

215A,P; 220L,N; 223H,D; 228N; 232T; 233E; 235Y,L,T; 236Y f N; 

237F; 238L,M; 242P,S; 244D; 246V; 251eE,Q; 253P; 256D; 260A,H; 

264R,I; 265A,Q; 267D; 270Y,A,L,G; 271D,N; 273D,K; 275F r Y; 
10 278T,H; 280A, P; 283P; 287A,T; 288L,I,F; 292F,Y; 293A,V; 302R,H; 

304P,A; 332F; 336S; 337T,G,Q,S; 3381; 339V, I; 340P,A; 

343A,S,F, I,L; 348Y; 349P; 352K; 360R; 362P; 364W,F; 365V,L,A,S; 

366D,S,V; 367A,K; 368K; 3691, L; 370V; 373A,S; 374S,A; 375H; 

376M; 383kQ,E; 387P; 393V; 396R; 404A,G; 409R; 411K,T; 412R; 
15 417E,R; 421F,Y; 431E. 

3. The phytase variant of any of claims 1 or 2, which is 
derived from an ascomycete phytase. 

20 4 . The phytase variant of claim 3 which is derived from an 
Aspergillus phytase. 

5. The phytase variant of claim 4, wherein the model phytase 
is a strain of Aspergillus niger, Aspergillus ficuum, 

25 Aspergillus nidulans, Aspergillus fumigatus, Aspergillus 
terreus. 

6. The phytase variant of claim 5 wherein the model phytase 
is Aspergillus nidulans DSM 9743; or any of the following 

30 strains of Aspergillus terreus: CBS 116.46, DSM 9076, CBS 
220.95. 
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5 



7. The phytase variant of claim 6 wherein the model phytase 
is the Aspergillus nidulans phytase sequence shown in Fig. 10; 
or the Aspergillus terreus phytase sequence shown in Fig. 12. 

8. The phytase variant of claim 3 wherein the model phytase 
is a strain of Thermomyces lanuginosus, Talaromyces 
thermophilus, or Myceliophthora thermophila. 

10 9. The phytase variant of claim 8 wherein the model phytase 
is Thermomyces lanuginosus CBS 586.94/ or any of the following 
strains of Talaromyces thermophilus: ATCC 20186, ATCC 74338; or 
any of the following strains of Myceliophthora thermophila: ATCC 
34625, ATCC 74340. 

15 

10. The phytase variant of claim 9 wherein the model phytase 
is the Thermomyces lanuginosus phytase sequence shown in Fig. 14; 
or the Talaromyces thermophilus sequence shown in Fig. 13; or the 
Myceliophthora thermophila phytase sequence shown in Fig. 7. 

20 

11. The phytase variant of claim 3 wherein the model phytase 
is an ascomycete consensus phytase sequence. 

12. The phytase variant of any of claims 1 or 2, which is 
25 derived from a basidiomycete phytase. 

13. The phytase variant of claim 12, wherein the model phytase 
is a strain of Paxillus involutus, Trametes pubescens, Agrocybe 
pediades, or Peniophora lycii. 

30 
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14. The phytase variant of claim 13 wherein the model phytase 
is Trametes pubescens CBS 100232 or Paxillus involutus CBS 
100231. 

5 15. The phytase variant of claim 14 wherein the model phytase 
is the Trametes pubescens phytase sequence of Fig. 4 or either 
of the Paxillus involutus phytase sequences of Figs. 2 and 3. 

16. The phytase variant according to any of claims 1 or 2, 
10 which comprises at least one of the following amendments: 

R24C; V27P; H39Q,S; L40N; G42S; 

Q43A,C,D,E,F,G,H, I, K, L, M, N, P, R, S, T, V,W, Y; Y44N; A45D,S; F47Y; 

S49P; A51E,R; V56P; A58D>K; V62I; S69Q; Y75W, F; D78S; S79G; 

K80A; G81A, Q,E; K82T; K83A, I,R,Q; Y84Q,I,V; E90R,A; D115N; 
15 D116S; T118V,L; P119E; F120L; E122A; Q123N,T; L125S,M ; V126H,S; 

N127Q,E; S128A,T; F132I,L; I148V; S151A; S153D,Y; S154Q,D,G; 

I157V; A158D; S159T: G160A,S; K161T,N; K162N; F163W; Rl70fH; 

Q171N; G173Q,S; S184P,Q; E185S; A186E,P; S187A; T190P,A; P193S; 

G194S,T; T195V,L; V198A,N; E200G,V; D201E; S201d(); E201e(),T; 
20 L201f(); preferably all three deletions; A202S; D203R,K,S; 

D203aV,T; V204Q, E, S,A; T211L,V; S215AP; L220N; D223H; T228N; 

T235Y,L; Y236N; L237F; M238L; S242P; I246V; K251eE,Q; H260A; 

I264R; N265Q,A; Q270Y,A,L,G; S271D,N; K273D; Y275F; H278T; 

A280P; T287A; Q288L / I,F; Y292F; A293V; H302R; P304A; N336S; 
25 03373^,0; I339V; S340P,A; F343A, S, F, I , L; N349P; N360R; T362P; 

F364W; S365V,L,A; S366D,V; A367K; W368K; T369I,L; A373S: S374A; 

R375H; L376M; Q383kE; P404A,G; T411K; R417E; F421Y; A431E. 

17. The phytase variant of claim 16, the model phytase of 
30 which is an Aspergillus derived phytase, preferably derived from 

Aspergillus ficuum or Aspergillus niger. 
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18. The phytase variant of claim 17, the model phytase of 
which is a phytase derived from either of Aspergillus ficuum 
(niger) NRRL 3135, Aspergillus niger ATCC 9142, or Aspergillus 

5 niger ATCC 74337. 

19. The phytase variant of claim 18, the model phytase of 
which is the Aspergillus ficuum phytase sequence of Fig. 11. 

10 20. The phytase variant according to any of claims 1 or 2, 
which phytase variant comprises at least one of the following 
amendments : 

A24C; V27P; H39,S,Q; L40N; G42S; Q43C, D, E, F, H, K,M, P, R, S, W, Y; 

Y44N; S45D; F47Y; S49P; E51A,R; L56P; K58D,A; D59G; I62V; S69Q; 
15 Y75W,F; S78D; S79G; K80A; S81A,G,Q,E; K82T; K83A,I,Q,R; 

Y84Q,V,I; V88K; A90R; F102Y; D115N; D116S; T118V,L; P119E; 

F120L; E122A; Q123N,T; L125S,M; V126H,S; N127Q,E; S128A,T; 

F132,I,L; S143N; I148V; S151A; S153D,Y; D154Q,S,G; I157V; A158D; 

S159T; G160A,S; E161T,N; K162N; F163W; G170fH; ()171N; N173Q,S; 
20 T172P; P184Q,S; E185S; S186A,E,P; E187A; T187aS; T190P,A; 

G194S,T; V195L,T; K198A / N,V; E200G,V; A201D,E; S201d(); 

Q201e(),T; L201f(); preferably all three deletions; G202S,A; 

D203R,K,S; E203aV,T; V204Q,E,S,A; A205E; L211V; A220L,N; H223D; 

T228N; E232T; D233E; V235Y,L,T; V236Y,N; L237F; M238L; C242P,S; 
25 T246V; Q251eE,Q; Q256D; H260A; K264R,I; K265Q,A; N267D; 

Q270Y,A,L,G; S271D,N; G273D,K; Y275F; Y278T,H; A280P; A287T; 

Q288L,I,F; F292Y; T293A,V; R302H; P304A; F332F; N336S; 

S337T,G,Q; M338I; V339I; S340P,A; F343A,S,I,L; N349P; E352K; 

S360R; K362P; Y364W,F; S365V,L,A; A366D,V,S; S367A,K; W368K; 
30 V369I,L; G373S,A; R375H; A376M; K383kQ,E; D404A,G; K411T; I393V; 
L412R; K417E,R; W421F,Y; G431E. 
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21. The phytase variant of claim 20, which is derived from an 
Aspergillus phytase, preferably using a model phytase derived 
from Aspergillus fumigatus. 

5 

22. The phytase variant of claim 21, the model phytase of 
which is a phytase derived from either of the following strains 
of Aspergillus fumigatus: ATCC 13073, ATCC 32722, ATCC 58128, 
ATCC 2690 6 or ATCC 32239. 

10 

23. The phytase variant of claim 22, the model phytase of 
which is the Aspergillus fumigatus phytase sequence of Fig. 8. 

24. The phytase variant according to any of claims 1 or 2, 
15 which phytase variant comprises at least one of the following 

amendments : 

G24C; V27P; H39S,Q; L40N; G42S; 

Q43A,C,D,E,F,G,H,I,K,L,M,N,P,Q,R,S,T,V,W,Y; Y44N; S45D; Y47F; 

S49P; E51A,R; V56P; D58K,A; D59G; V62I; S69Q; Y75W, F; S78D; 
20 S79G; K80A; S81A,G,Q,E; K82T; A83I,Q,K,R; Y84,Q,I,V; A90R; 

D115N; D116S; T118V,L; F119E; P120L; E122A; N123Q,T; M125S; 

V126H,S; N127Q,E; S128A,T; Y132F,I,L; K143N; I148V; S151A; 

S153D,Y; D154Q,S,G; I157V; A158D; S159T; A160S; E161T,N; K162N; 

F163W; G170fH; S170gA; Q171N; H173Q,S; P184Q,S; E185S; 
25 G186A,E,P; S187A; G187aS; T190P,A; H193S; G194S,T; T195V,L; 

A198N,V; E200G,V; D201E; S201d(); E201e(),T; L201f(); preferably 

all three; G202S,A; D203R,K, S; D203aV,T; V204Q, S, A, E; L211V; 

A215P; L220N; D223H, T228N; E232T; D233E; V235Y,L,T; Y236N; 

L237F; M238L; P242S; E244D; E251e,Q; A256D; H260A; R264I; Q265A; 
30 Q270Y,A,L,G; S271D,N; G273D, K; Y275F; Y278T,H; A280P; A287T; 

Q288L,I,F; F292Y; A293V; R302H; P304A; N336S; S337T,Q,G; M338I; 
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I339V; S340P,A; F343A, S, I , L; N349P; A352K; S360R; E362P; 
Y364W,F; S365V,L,A; A366D,V,S; S367K,A; W368K; T369I,L; G373S,A; 
A374S; R375H; A376M; Q383kE; A404G; K411T; E417R; F421Y; A431E. 

5 25. The phytase variant of claim 24, the model phytase of 
which is an ascomycete consensus phytase. 

26. The phytase variant of claim 25, the model phytase of 
which is the ascomycetes consensus sequence "conphys" of Fig. 9. 

10 

27. The phytase variant according to any of claims 1 or 2, 
which phytase variant comprises at least one of the following 
amendments: 

V24C; F27P; ()31Y; F33C; D39H,S,Q; S40L,N; A42S,G; 
15 A43C,D,E, F,G,H, I, K, L, M, N, P, Q, R,S,T,V,W, Y; Y44N; T45D,S; Y47F 
Q51E,A,R; K58D,A; K61R; I62V; F75W; S78D; A80K; G81A,Q,E 
R83A,I,Q,K; I84Y,Q,V; V88I; K90R,A; L102Y; D115N; D116S; V118L 
P119E; F120L; L123N, T, Q; S125M; S126H,V; Q127E,N; A128S,T 
T132F,I,L; E143N; V148I; S151A; S152G; S153D,Y; N154D,Q,S / G 

20 D158A; S159T; A160S; T161N; ()170fH; ()170gA; ()171N; H173Q,S 
H172P; S184Q,P; E185S; S186A,E,P; L187A; ()187aS; T190P,A 
D193S; A194S,T; M195T,V,L; N198A,V; G200V; S201D,E ()201eT 
S202A; D203R,K,S; P203aV,T; Q204E / S / A,V; T205E; I211L,V; P215A 
L220N; Q223D,H; A232T; D233E; S235Y,L,T; N236Y; L237F; I238L,M 

25 A242P,S; E244D; I246V; ()251eE / Q; N256D; P260A,H; A264R,I 
Q265A; E267D; G270Y,A / L; L332F; D271N; D273K; F275Y; T278H 
Y280A / P; Y283P; V287A,T; Q288L / I,F; Y292F; I293A,V; E302R,H 
P304A; L332F; N336S; Q337T f S,G; M338I; I339V; A340P 
S343A, F, I, L; F348Y; N349P; S352K; P360R; R362P; W364F 

30 V365L,A,S; T366D r V,S; S367K,A; R368K; L369I; T370V; S373A 
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A374S; R375H; S383kQ,E; T387P; A396R; G404A; L409R; T411K; 
L412R; E417R; Y421F. 

28. The phytase variant of claim 27, the model phytase of 
5 which is a phytase derived from Agrocybe pediades, 

29. The phytase variant of claim 27, the model phytase of 
which is a phytase derived from Agrocybe pediades CBS 900.96. 

10 30. The phytase variant of ciaim 2 9, the model phytase of 
which is the Agrocybe pediades phytase sequence of Fig. 5. 

31. The phytase variant according to any of claims 1-2, which 
phytase variant comprises at least one of the following 
15 amendments: 

F24C; V27P; L31Y; I33C; S39H,Q; N40L; G42S; 

P43A,C,D,E,F,G,H,I,K,L,M,N,Q,R,S,T,V,W / Y; Y44N; D45S; F47Y; 

E51A,R; E58D,K,A; T61R; V62I; W75F; S78D; A80K; R81Q,E,G,A; 

S82T; R83A,I,Q,K; Q84Y,V,I; V88I; K90R,A; A115N; D116S; L118V; 
20 P119E; F120L; N123T,Q; S125M; H126S,V; Q127E,N; T128A,S; 

M132F,I,L; G143N; V148I; A151S; D153Y; Q154D,S,G; D158A; S159T; 

S160A; T161N; ()170fH; ()170gA; S171NG172P; E173Q,S; Q184S,P; 

E185S; E186A, P; G187A; <)187aS; T190P,A; N193S; N194S,T; 

M195T,V,L; N198A,V; V200G; D201E; ()201eT; G202S,A; D203R,K,S; 
25 ()203aV,T; E204Q,S,A,V; S205E; V211L; N215A, P; L220N; A223D,H; 

S232T; D233E; L235Y,T; T236Y,N; L237F; M238L; P242S; L246V; 

()251eE,Q; A260H; V264R, I; S265Q,A; E267D; Y270A,L,G; D271N; 

D273K; F275Y; G278T,H; P280A; A283P; T287A; Q288L, I,F; Y292F; 

V293A; G302R,H; A304P; N336S; T337Q,S,G; M338I; V339I; P340A; 
30 A343S,F, I,L; F348Y; N349P; A352K; E360R; R362P; W364F; 
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V365L,A,S; D366V,S; S367K,A; L369I; S373A; G374A,S; ()383kQ,E; 

E387P; A396R; G404A; V409R; E411K,T; L412R; E417R; Y421F; A431E. 

32. The phytase variant of claim 31, the model phytase of 
5 which is a phytase derived from Peniophora lycii. 

33. The phytase variant of claim 32, the model phytase of 
which is a phytase derived from Peniophora lycii CBS 68 6.96. 

10 34. The phytase variant of claim 33, the model phytase of 
which is the Peniophora lycii phytase sequence of Fig. 6. 

35. A phytase polypeptide which comprises a phytase variant 
according to any of the previous claims. 

15 

36. A DNA construct comprising a DNA sequence encoding a 
phytase variant according to any one of claims 1-34. 

37. A recombinant expression vector which comprises a DNA 
20 construct according to claim 36. 

38. A host cell which is transformed with a DNA construct 
according to claim 36 or a vector according to claim 37. 

25 39. A process for preparing a phytase variant, the process 
comprising culturing the host cell according to claim 38 under 
conditions permitting the production of the phytase variant, and 
recovering the phytase from the culture broth. 

30 4 0. A feed or food comprising at least one phytase variant of 
any of claims 1-34 . 
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41. A process for preparing a feed or food according to claim 
4 0, wherein the at least one phytase variant is added to the 
food or feed components. 

5 

42. A composition comprising at least one phytase variant of 
any of claims 1-34. 

43. The composition according to claim 42 suitable for use in 
10 food or feed preparations. 

44. The composition according to any of claims 42-43 which is 
an animal feed additive. 

15 45. A process for reducing phytate levels in animal manure 
comprising feeding an animal with an effective amount of the 
feed according to claim 40 or obtainable according to claim 41. 

46. Use of the phytase variant of any of claims 1-34; or the 
20 composition of any of claims 42-43 for liberating phosphorous 

from a phytase substrate. 

47. A transgenic plant or plant part which is capable of 
expressing a phytase variant according to any one of claims 1- 

25 34. 
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15 



20 



Peniophora numbers 1 

Alignment numbers l 

P_involtus_Al ML FGFVALACLL 

P_involtus_A2 MH LGFVTLACLI 

Tjubescena MAFSILASLL 

A_pediades MSLFIGGCLL 

P_lycii mv SSAFAPSILL 

A_fumigatus MVTL TFLLSAAYLL 

consphyA MGVF WLLSIATLF 

A_nidulans MAFP TVALSLYYLL 

A_ficuum_NRRL3135 MGVS AVLLPLYLLS 

A_terreus MGFL AIVLSVALLF 

T_thermo MSLL LLVLSGGLVA 

T_lanuginosa MAGIGLGSFL VLLLQFSALL 

M_thermophila MTGL GVMWMVGFL 



37 
50 

SLSEVLATSV P KNT APTFPIKSE 

HLSEVFAASV P RNI APKFSISESE 

FVCYAYARAV PRAHIPLRDT SACLDVTRDV 

VFLQASAYGG WQATFVQPF FEPQI 

SLMSSLALST QFSF V AAQLPiaQN 

.SGRVSAAPS SAGSKSCDTV DLGYQCS5AT 
GSTSGTALGP RGNSHSCDTV DGGYQCEPEI 
• - SRVSAQAP WQNHSCNTA DGGYQCTPNV 
GVTSGLAVPA SRNQSSCDTV DQGYQCTSET 
RSTSGTPLGP RGKHSDCNSV DHGYQCEPEL 
LYVS. . .RNP HVDSHSCNTV EGGYQCHPEI 

TASPAIPPFW RKKHPNVD i 

AIASL QSESRPCDTP DLGFQCGEAI 



25 



30 



P_involtus_Al 
P_involtus__A2 
T_j?ubescens 
A_pediades 
P_lycii 
A_fumigatus 
conspliyA 
A__nidulans 
A_f i cuumJCTRRL 3 13 5 
A_terreus 
TJihermo 
TJLanuginosa 
M^t he rmophi 1 a 



38 
51 

QR1TWSPYSPY 
QRNWSPYSPY 
QQSWSMYSPY 
QDSWAAYTPY 
TSNWGPYDPF 
SHLWGQYSPF 
SHLWGQYSPY 
SHVWGQYSPY 
SHLWGQYAPF 
SHKWGLYAPY 
SHSWGQYSPF 
ARHWGQYSPF 
SHFWGQYSPY 



FPLAEYKA. . 
FPLAEYKA. . 
FPAATYVA. . 
YPVQAYTP. . 
FPVEPYAA. . 
FSLKDELSVS 
FSLEDESAIS 
FSIEQESAIS 
FSLANESVIS 
FSLQDESPFP 
FSLADQSEIS 
FSLAEVSEIS 
FSVP. .SELD 



. . PPAGCQIN 
. . PPAGCEIN 
. . PPASCQIN 
. . PPKDCKIT 
. . PPEGCTVT 
SKLPKDCRIT 
PDVPDDCRVT 
EDVPHGCEVT 
PEVPAGCRVT 
LD VP ED CHIT 
PDVPQNCKIT 
PAVPKGCRVE 
ASIPDDCEVT 



QVNIIQRHGA 
QVNIIQRHGA 
QVHIIQRHGA 
QVNIIQRHGA 
QVNLIQRHGA 
liVQVL SRHGA 
FVQVLSRHGA 
FVQVLSRHGA 
FAQVLSRHGA 
FVQVLARHGA 
FVQLLSRHGA 
FVQVLSRHGA 
FAQVLSRHGA 



83 
100 

RFPTSGATTR 
RFPTSGAATR 
RFPTSGAAKR 
RFPTSGAGTR 
RWPTSGARSR 
RYPTSSKSKK 
RYPTSSKSKA 
RYPTESK5KA 
RYPTDS KGKK 
RSPTHSKTKA 
RYPTSSKTEL 
RYPTAHK5EV 
RAPTLKRAAS 



40 



35 

P_involtus_Al 
P_involtus_A2 
T_pubescens 
Ajpediades 
PJLycii 
A__fumigatus 
consphyA 
Ajnidulans 
A_f i cuum_NRRL3 135 
4 ^ A_terreus 
T_thermo 
T_lanuginosa 
M_thermophi 1 a 



50 



84 
101 

IKAGLTKLQG 
IKAGLSKLQS 
IQTAVAKLKA 
IQAAVKKLQS 
QVAAVAKIQM 
YKKLVTAIQA 
YSALIEAIQK 
YSGLIEAIQK 
YSALIEEIQQ 
YAATIAAIQK 
YSQLISRIQK 
YAELLQH1QD 
YVDLIDRIHH 



VQNFTDAKFN 
VQNFTDPKFD 
ASNYTDPLLA 
AKTYTDPRLD 
ARPFTDPKYE 
NATDFKGKFA 
NATAFKGKYA 
NATSFWGQYA 
NATTFDGKYA 
SATAFPGKYA 
TATAYKGYYA 
TATEFKGDFA 
GAISYGPGYE 



FIKSFKYDLG 
FIKSFTYDLG 
FVTNYTYSLG 
FLTNYTYTLG 
FLNDFVYKFG 
FLKTYNYTLG 
FLKTYNYTLG 
FLESYNYTLG 
FLKTYNYSLG 
FLQSYNYSLD 
FLKDYRYQLG 
FLRDYAYHLG 
FLRTYDYTLG 



1TSDLVFFGAA 

TSDLVPFGAA 

QDSLVELGAT 

HDDLVPFGAL 

VADLLPFGAN 

ADDLTPFGEQ 

ADDLTPFGEN 

ADDLTIFGEN 

ADDLTPFGEQ 

SEELTPFGRN 

ANDLTPFGEN 

ADNLTRFGEE 

ADELTRTGQQ 



133 
150 

QSFDAGQBAF 
QSFDAGLEVF 
QSSEAGQEAF 
QSSQAGEETF 
QSHQTGTDMY 
QLVNSGHCFY 
QMVNSGIKFY 
QMVDSGAKFY 
ELVNSGIKFY 
QLRDLGAQFY 
QMIQLGIKFY 
QMMESGRQFY 
QMVNSGIKFY 



P_involtus_Al 
P involtus A2 



176 

1S1 * 200 

ARYSKLVSKN NLPFIRADGS DRWDSATNW TAGFASA SHNTVQ 

ARYSKLVSSD NLPFIRSDGS DRWDTATNW TAGFASA SRNAIQ 
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T p ubegcena 
Ajpediades 
PJLycii 
A__fumigatus 
5 consphyA 
A_nidulans 
A_f i cuum_NRRL3 13 5 
A^terreus 
T_thermo 
1 0 T_l anuginos a 

M__thermophila 



TRYSSLVSAD 
QRYSFLVSKE 
TRYSTLFEGG 
QRYKAL.ARS 
RRYKAL.ARK 
RRYKNL.ARK 
QRYESL.TRN 
ERYNAL.TRH 
NHYKSL.ARN 
HRYREQ.ARE 
RRYRAL.ARK 



ELPFVRASGS 
NLPFVRAS SS 
DVPFVRAAGD 
WPFIRASGS 
IVPFIRASGS 
NTPFIRASGS 
IVPFIRSSGS 
INPFVRATDA 
AVPFVRCSGS 
IVPFVRAAGS 
SIPFVRTAGQ 



DRWATANNW 
NRWDSATNW 
QRWDSSTNW 
DRVIASGEKF 
DRVIASAEKF 
DRWASAEKF 
SRVIASGKKF 
SRVHESAEKF 
DRVIASGRLF 
ARVIASAEFF 
DRWHSAENF 



TAGFAIiA. . . 
TEGFSAA. . . 
TAGFGDA. . . 
IEGFQQAKLA 
IEGFQSAKLA 
INGFRKAQLH 
IEGFQSTKLK 
VEGFQTARQD 
IEGFQSAKVL 
NRGFQDAKDR 
TQGFHSAIiLA 



SSNSIT 

. . . . SHHVLN 
.... SGETVL 
DPGA.TNRAA 
DPGSQPHQAS 
DHGS . . KRAT 
DPRAQPGQSS 
DHHANPHQPS 
DPHSDKHDAP 
DPRSNKDQAE 
DRGSTVRPTL 



15 



20 



25 



30 



P_involtus_Al 
P_involtus_A2 
T_jpubescens 
Ajpediades 
P_lycii 
A^f umi ga tus 
consphyA 
A_nidulans 
A_f icuum_NRRL3 135 
A_terreus 
T__thermo 
T_ lanuginosa 
M_thermophi 1 a 



177 
201 

PKLNLILPQT 
PKLDIilLPQT 
PVLSVIISEA 
PILFVTLSES 
PTLQWLQEE 
PAISVIIPES 
PVIDVIIPEG 
PWNVTIFEI 
PKIDWISEA 
PRVDVAIPEG 
PTINVTIEEG 
FVINVTISEE 
PYDMWIPET 



G. .NDTLEDN 
G. .NDTLEDN 
G. .NDTLDDN 
L. .NDTLDDA 
G. -NCTLCNN 
ETFNNTLDHG 
SGYNNTLDHG 
DGFNNTLDHS 
SSSNNTLDPG 
SAYNNTLEHS 
PSYNNTLDTG 
TGSNNTLDGL 
AGANNTLHND 



MCPAAGD. 
MCPAAGE . 
MCPAAGD . 
MCPNAGS . 
MCPNEVD . 
VCTKFEA. 
TCTAFED. 
TCVSFEN. 
TCTVFED. 
LCTAFES . 
SCPVFED. 
TCPAAEE. 
LCTAFEEGPY 



. SDPQVNA 
. SDPQVDA 
. SDPQVNQ 
. SDPQTGI 
. GD . ESTT 
SQLGDEVAAN 
SELGDDVEAN 
DERADE IEAN 
SELADTVEAN 
STVGDDAVAN 
SSGGHDAQEK 
AP.DPTQPAE 
STIGDDAQDT 



217 
250 

WLAVAFPSIT 
WIiASAFPSVT 
WLAQFAPPMT 
WTSIYGTPIA 
WLGVFAPNIT 
FTALFAPDIR 
FTALFAPAIR 
FTAIMGPPIR 
FTATFVPSIR 
FTAVFAPAIA 
FAKQFAPAIIi 
FLQVFGPRVL 
YLSTFAGPIT 



35 



40 



45 





218 








252 




251 








300 


P_involtus_Al 


ARLNAAAPSV 


NLTDTDAFNL 


VSLCAFLTVS 






P_invol tus_A2 


AQLNAAAPGA 


NLTDADAFNL 


VSLCPFMTVS 






T pubescens 


ARXiNfAGAPGA 


NLTDTDTYNL 


LTLCPFETVA 






Ajpediades 


NRLNQQAPGA 


NITAADVSNL 


IPLCAFETIV 






P_lycii 


ARLNAAAPSA 


KLSDSDALTL 


MDMCPFDTLS 






A^futnigatus 


ARAEKHLPGV 


TLTDEDWSL 


MDMCS FDTVA 


RTSD. .ASQ. 




consphyA 


ARLEADLPGV 


TLTDEDWYL 


MDMCPFETVA 


RTSD. .ATE. 




A_nidulans 


KRLENDLPGI 


KLTNENVIYL 


MDMCSFDTMA 


RTAH. .GTE. 




A_f icuum_NRRL3 135 


QRLENDLSGV 


TLTDTEVTYL 


MDMCS FDTIS 


TSTV. .DTK. 


LS 


A^terreus 


QRLEADLPGV 


QLSTDDWNL 


MAMCPFETVS 


LTDD. .AHT . . 


XjS 


T_th.ermo 


EKIKDHIiPGV 


DLAVSDVPYL 


MDLCPFETItA 


RNHT . .DT. . 


• ••»•••• LS 


T_l anugino s a 


KKITKHMPGV 


NLTLEDVPLF 


MDLCPFDTVG 


SDPVLFPRQ. 


LS 


M_thermophi la 


ARVNANLPGA 


NLTDADTVAL 


MDLCPFETVA 


SSSSDPATAD 


AGGGNGRPLS 



P_involtus_Al 
50 P_involtus_A2 
T_pubescens 
Ajpediades 
P_lycii 
A_fumigatus 
55 consphyA 
A nidulans 



253 
301 

DFCTLFEGIP 
DFCTLFEGIP 
EFCDIYEELQ 
PFCNLFT, .P 
PFCDLFT . .A 
PFCQIiFT. .H 
PFCALFT. .H 
PFCAIFT. .E 



GSFEAFAYGG 
GSFEAFAYAG 
AE.DAFAYNA 
EEFAQFEYFG 
EEYVSYEYYY 
NEWKKYNYLQ 
DEWRQYDYLQ 
KEWIiQYDYIiQ 



DLDKFYGTGY 
DLDKFYGTGY 
DLDKFYGTGY 
DLDKFYGTGY 
DLDKYYGTGP 
SLGKYYGYGA 
SLGKYYGYGA 
SLSKYYGYGA 



GQELGPVQGV 
GQALGPVQGV 
GQPLGPVQGV 
GQPLGPVQGV 
GNALGPVQGV 
GNPLGPAQGI 
GNPLGPAQGV 
GSPLGPAQGI 



300 
350 

GYVNELIARIj 
GYINELLARIi 
GYINELIARL 
GYINELLARIi 
GYVNELLARL 
GFTNELIARL 
GFANELIARL 
GFTNELIARL 
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A_f icuum_NRRL3 135 
A_terreua 
T_thermo 
T_lanuginosa 
M_thermophila 



P_involtU8_Al 
P_iaxvolt:u3~A2 
Tjpubescens 
Ajpediadea 
PJLycii 
A_fumigatua 
consphyA 
Ajnidulana 
A_f icuum_NRRL3 135 
A_terreus 
T^thermo 
2 0 T_l anuginosa 

M_thermophila 



10 



15 



PFCDLFT. 
PFCDLFT. 
PFCALST. 
PFCHLFT. 
PFCRLFS . 



>H DEWINYDYLQ 
.A TEWTQYNYLL 
■Q EEWQAYDYYQ 
A DDWMAYDYYY 
E SEWRAYDYLQ 



SLKKYYGHGA 
SLDKYYGYGG 
SLGKYYSNGG 
TLDKYY5HGG 
SVGKWYGYGP 



GNPLGPTQGV 
GNPLGPVQGV 
GNPLGPAQGV 
GSAFGPSRGV 
GNPLGPTQGV 



301 
351 

TNS . AVRDNT 
TNS.AVNDNT 
TAQ.NVSDHT 
TEM.PVRDNT 
TGQ.AVRDET 
TRS . PVQDHT 
TRS . PVQDHT 
TQS . PVQDNT 
THS . PVHDDT 
TRA.PVHDHT 
THS . PVQDYT 
TGNLPVKDHT 
A.GVPVRDGT 



QTNRTLDASP 

QTNRTLDAAP 

QTNSTLDSSP 

QTNRTLDSSP 

QTNRTLDSDP 

STNSTLVSNP 

STNHTLDSNP 

STNHTLDSNP 

SSNHTIjDSSP 

CVNNTLDASP 

TVNHTLDSNP 

TVNHTLDDNP 

STMRTLDGDP 



VTFPLNKTFY 

DTFPLNKTMY 

ETFPLNRTLY 

LTFPLDRSIY 

ATFPLNRTFY 

ATFPLNATMY 

ATFPLNATLY 

ATFPLDRKLY 

ATFPLKSTLY 

ATPPLNATLY 

ATFPLNATLY 

ETFPLDAVLY 

RTFPLGRPLY 



ADFSHDNLMV 

ADFSHDNLMV 

ADFSHDNQMV 

ADLSHDNQMI 

ADFSHDNTMV 

VDFSHDNSMV 

ADFSHDNSMI 

ADFSHDNSMI 

ADFSHDNGII 

ADFSHDSNLV 

ADFSEDNTMT 

ADFSHDNTMT 

ADFSHDNDMM 



GYANELIARL 
GWANELMARL 
GFVNELIARM 
GFVNELIARM 
GFVNELLARL 

349 
400 

AVFSAMGLFR 

AVFSAMGLFR 

AIFSAMGLFN 

AIFSAMGLFN 

PIFAALGLFN 

SIFFALGLYN 

SIFFALGLYN 

S IFFAMGLYN 

SILFALGLYN 

S IFWALGL YN 

SIFAALGLYN 

GIFSAMGLYN 

GVLGALGAYD 



30 



25 

P_involtus_Al 
P_involtus_A2 
Tjubescens 
Ajediades 
P_lycii 
A_fumigatus 
consphyA 
A_nidulans 
A_f icuumJNRRL3 135 
35 A_terreus 
T_thermo 
^lanuginosa 
M_thermophila 



350 
401 

QPAPLSTSVP 

QSAPLSTSTP 

QSAPLDPTTP 

QSSPLDPSFP 

ATA.LDPLKP 

GTEPLSRTSV 

GTAPLSTTSV 

GTQPLSMDSV 

GTKPLSTTTV 

GTAPLSQTSV 

GTAKLSTTEI 

GTKPLSTSKI 

GVPPIiDKTAR 



NPWR T WRTSSLVPFS 

DPNR . . T WLTSSWPFS 

DPAR T FLVKKIVPFS 

NPKR . T WVTSRLTPFS 

DENR Ii WVDSKLVPFS 

ESAKE..LDG YSASWWPFG 
ESIEE..TDG YSASWTVPFG 
ESXQE. . MDG YAASWTVPFG 
ENITQ..TDG FSSAWTVPFA 
ESVSQ..TDG YAAAWTVPFA 
KSIEE. .TDG YSAAMTTVPFG 
QPPTGAAADG YAASWTVPFA 
RDPEE..LGG YAASWAVPFA 



383 

450 

GRMWERLSC 

ARMAVERLSC 

ARMWERLDC 

ARMVTERLLC QRDGTGSGGP 

GHMTVEKLAC 

ARAYFETMQC 

ARAYVEMMQC 

ARAYFELMQC 

SRLYVEMMQC 

ARAYVEMMQC 

GRAYIEMMQC 

ARAYVELLRC ETETSSEEEE 
ARIYVEKMRC SGGGGGGGGG 



40 



45 



50 



55 



384 

451 

p _involtus_Al ....... FGT 

P_involtus_A2 AGT 

T_pubescens GGA 

Ajpediades SRIMRNGNVQ 

P_lycii sgk 

A_fumigatus K. .S . . . EKE 

consphyA Q. .A. . , EKE 

A_nidulans E KKE 

A_eicuum_NRRL3135 Q. .A. . . EQA 

A_terreua R. .A. . . EKE 

T_thernto D . . b . . . SDE 

T__lanuginosa E . . G . . . EDE 

M_thermophila E . . GRQEKDE 



TKVRVLVQDQ 

TKVRVIiVQDQ 

QSVRLLVNDA 

TFVRILVNDA 

EAVRVLVNDA 

PLVRALINDR 

PLVRVLVNDR 

PLVRVLVNDR 

PLVRVLVNDR 

PLVRVLVNDR 

PWRVLVNDR 

PFVRVLVNDR 

EMVRVLVNDR 



VQPLEFCGGD 

VQPLEFCGGD 

VQPLAFCGAD 

LQPLKFCGGD 

VQPLEFCGG. 

WPLHGCDVD 

WPLHGCAVD 

WPLHGCAVD 

WPLHGCPVD 

VMPLHGCPTD 

WPLHGCEVD 

WPLHGCRVD 

VMTLKGCGAD 



RNGLCTLAKF 
QDGLCALDKF 
TSGVCTLDAF 
MDSLCTLEAF 
VDGVCELSAF 
KLGRCKLNDF 
KLGRCKRDDF 
KFGRCTLDDW 
ALGRCTRDSF 
KLGRCKRDAF 
SLGRCKRDDF 
RWGRCRRDEW 
ERGMCTLERF 



425 
500 

• VESQTFARSD 
VESQAYARSG 
VESQAYARND 
VESQKYARED 
VESQTYAREN 
VKGLSWARSG 
VEGLSFARSG 
VEGLNFARSG 
VRGLSFARSG 
VAGLSFAQAG 
VRGLSFARQG 
IKGLTFARQG 
IE SMAFARGN 



426 



439 
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P_involtus_Al 
P_involtus_A2 
T_pube3cens 
5 A_j?ediades 
P_lycii 
A_fumigatus 
consphyA 
A_nidulans 
1 0 A_f i cuumJNRRU 13 5 
A_terreus 
T_thernio 
TjLanugiiiosa 
M_thermophila 

15 



501 514 
GAGDFEKCFA TSA 
GAGDFEKCIiA TTV 
GEGDFEKCFA T. . 
GQGDFEKCFD . . . 
GQGDFAKCGF VPSE 
. . GITWGECFS . . . 
. . GNWAECFA * . . 
. . GNWKTCFT L . . 
. . GDWAECFA . . . 
. . GNWADCF . ... 
. . GNWEGCYA ASE 
. .GHWDRCF. . . . 
. . GKWDLCFA . . „ 
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(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1522 base pairs . 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDHA 

(vi) ORIGINAL SOOECE: 

(A) ORGANISM: Pasiilus involutus 

(B) STRAIN: CBS 100231 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) L0CATI0N:58..1383 

(ix) FEATURE: 

(A) NAME /KEY: iaat_peptide 
(BJ LOCATION:115..1383 

(ix) FEATURE :. 

(A) NAME /KEY: sig_peptide 

(B) LOCATION: 58. ,114* 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

GGATCCGAAT TCGGCACTCG TACGGTCCCC CGGTCTACCC TCTGCTCGCC TTGGAAG 57 

ATG CTC TTC GGT TTC GTC GCC CTC GCC TGT CTC TTG TCC CTC TCC GAG 105 
Met Leu Phe Gly Phe Val Ala Leu Ala Cys Leu Leu Ser Leu Ser Glu 
-15 -10 -5 

GTC CTT GCG ACC TCC GTG CCC AAG AAC ACA GCG CCG ACC TTC CCC ATT 153 
Val Leu Ala Thr Ser Val Pro Lys Asn Thr Ala Pro Thr Phe Pro He 
15 10 

CCG GAG AGT GAG CAG CGG AAC TGG TCC CCG TAC TCG CCC TAC TTC CCT 201 
Pro Glu Ser Glu Gin Arg Asn Trp Ser Pro Tyr Ser Pro Tyr Phe Pro 
IS 20 25 

CTT GCC GAG TAC AAG GCT CCT CCG GCG GGC TGC CAG ATC AAC CAG GTC 249 
Leu Ala Glu Tyr Lys Ala Pro Pro Ala Gly Cys Gin He Asn Gin Val 
30 35 40 45 

AAC ATC ATC CAA AGA CAT GGT GCC CGG TTC CCG ACC TCT GGC GCG ACC 297 
Asn He He Gin Arg His Gly Ala Arg Phe Pro Thr Ser Gly Ala Thr 
50 55 60 

ACC CGT ATC AAG GCG GGT TTG ACC AAG TTG CAA GGC GTC CAG AAC TTT 345 
Thr Arg He Lys Ala Gly Leu Thr Lys Leu Gin Gly Val Gin Asn Phe 
65 70 75 

ACC GAC GCC AAA TTC AAC TTC ATC AAG TCG TTC AAG TAC GAT CTC GGT 393 
Thr Asp Ala Lys Phe Asn Phe He Lya Ser Phe Lys Tyr Asp Leu Gly 
80 85 90 

AAC TCG GAC CTC GTT CCG TTC GGT GCA GCA CAG TCC TTC GAC GCT GGT 4 41 

Asn Ser As? Leu Val Pro Phe Gly Ala Ala Gin Ser Phe Asp Ala Gly 
95 100 las 
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CAG GAG GCC TTC GCC CGC TAC TCG AAG CTT GTC AGC AAG AAC AAC CTG 489 
Gin Glu Ala Phe Ala Arg Tyr Ser Lys Leu Val Ser Lys Asn Asn Leu 



110 



115 120 125 



CCG TTC ATT CGT GCC GAT GGA AGT GAT CGT GTT GTG GAT TCT GCT ACA 537 
Pro Phe He Arg Ala Asp Gly Ser Asp Axg Val Val Asp Ser Ala Thr 
130 135 140 

AAC TGG ACT GCG GGT TTC GCT TCG GCA AGT CAC AAC ACG GTC CAG CCC 5B5 
Asn Trp Thr Ala Gly Phe Ala Ser Ala Ser His Asn Thr Val Gin Pro 
145 150 155 

AAG CTG AAC CTG ATT CTC CCG CAA ACT GGC AAT GAT ACC CTG GAA GAT 633 
Lys Leu Asn Leu He Leu Pro Gin Thr Gly Asn Asp Thr Leu Glu Asp 
160 165 170 

AAT ATG TGC CCT GCT GCT GGC GAT TCT GAC CCC CAG GTC AAC GCG TGG 6B1 
Asn Met Cys Pro Ala Ala Gly Asp Ser Asp Pro Gin Val Asn Ala Trp 
175 180 185 

TTG GCT GTT GCT TTC CCT TCC ATC ACT GCA CGG CTC AAC GCC GCC GCG 729 
Leu Ala Val Ala Phe Pro Ser He Thr Ala Arg Leu Asn Ala Ala Ala 
190 195 200 205 

CCC TCT GTC AAC CTC ACC GAC ACG GAC GCG TTC AAC CTC GTC AGT CTC 777 
Pro Ser Val Asn Leu Thr Asp Thr Asp Ala Phe Asn Leu Val Ser Leu 
210 215 220 

TGC GCT TTC TTG ACA GTC TCG AAG GAG AAG AAG AGT GAC TTC TGC ACC B25 
Cys Ala Phe Leu Thx Val Ser Lys Glu Lys Lys Ser Asp Phe Cys Thr 
225 230 235 

CTG TTC GAG GGC Arc CCT GGC TCT TTC GAG GCG TTC GCC TAT GGT GGC 873 
Leu Phe Glu Gly He Pro Gly Ser Phe Glu Ala Phe Ala Tyr Gly Gly 
240 245 250 

GAC CTT GAC AAG TTC TAC GGT ACC GGT TAC GGT CAG GAA CTC GGA CCC 921 
Aso Leu Asp Lys .Phe Tyr Gly Thr Gly Tyr Gly Gin Glu Leu Gly Pro 
255 260 265 

GTT CAA GGC GTC GGC TAC GTC AAC GAG CTC ATC GCC CGC CTC ACC AAC 969 
Val Gin Gly Val Gly Tyr Val Asn Glu Leu He Ala Arg Leu Thr Asn 
270 275 280 285 

TCC GCC GTC CGC GAC AAC ACC CAG ACG AAC CGC ACA CTC GAC GCC TCG 1017 
Ser Ala Val Arg Aap Asn Thr Gin Thr Asn Arg Thr Leu Asp Ala Ser 

295 300 



290 



1065 



CCC GTA ACC TTC CCG TTG AAC AAG ACG TTC TAC GCC GAT TTC TCC CAC 
Pro Val Thr Phe Pro Leu Asn Lys Thr Phe Tyr Ala Asp Phe Ser His 
305 310 315 

GAC AAC CTC ATG GTC GCC GTC TTC TCC GCC ATG GGC CTC TTC CGC CAG H13 
Aso Asn Leu Met Val Ala Val Phe Ser Ala Met Gly Leu Phe Arg Gin 
320 .325 330 

CCC GCG CCG CTC AGC ACG TCC GTG CCG AAC CCA TGG CGC ACG TGG CGC 1161 
Pro Ala Pro Leu Ser Thr Ser Val Pro Asn Pro Trp Arg Thr Trp Arg 
335 340 345 
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ACG AGC TCC CTC GTC CCC TTC TCC GGA CGC ATG GTC GTG GAA CGC CTC 120 9 

Thr Ser Ser Leu Val Pro Phe Ser Gly Arg Met Val Val Glu Arg Leu 
350 355 360 365 

AGC TGT TTC GGC ACG ACC AAG GTT CGC GTC CTC GTG CAG GAC CAG GTG 1257 
Ser Cys Phe Gly Thr Thr Lys Val Arg Val Leu Val Gin Asp Gin Val 
370 375 380 

CAG CCG CTC GAG TTC TGC GGG GGT GAT AGG AAC GGG CTG TGC ACG CTT 1305 
Gla pro Leu Glu Phe Cys Gly Gly Asp Arg Asn Gly Leu Cys Thr Leu 
385 390 395 

GCT AAG TTT GTG GAG AGC CAG ACG TTT GCG AGG AGT GAT GGT GCG GGG 1353 
Ala Lys Phe Val Glu Ser Gin Thr Phe Ala Arg Ser Aso Gly Ala Gly 
400 405 410 

GAC TTT GAG AAG TGC TTC GCG ACC TCG GCG TGAGGATGGA CGAACAAAAT 1403 
Asp Phe Glu Lys Cya Phe Ala Thr Ser Ala 
415 .420 

TAAATTGGGG TATTTTATCG TATAATTATG GTGTGTGTAG AACATGGGCT CGGGGTCGAT 14 63 
GGTGAAAAGC AAAGGTTTAT CGTCTAAAAA AAAAAAAAAA AAAAAATTCC TGCGGCCGC 1522 
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(2) INFORMATION FOR SEQ ID NO: 27: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1642 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNE SS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: . cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Paxillus involutus 

(B) STRAIN: CBS 100231 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 4 8.. 137 3 

(ix) FEATURE: 

(A) NAME /KEY: mat ^peptide 

(B) LOCATION: 105. .1373 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: 48 . • 104 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

GGATCCGAAT TCCAGTCCCC AAGCTAATCC TCTGCTCGCC TTGGAAG ATG CAC CTC 56 

Met His Leu 
-19 

GGC TTC GTC ACC CTC GCT TGT CTC ATA CAC CTC TCC GAG GTC TTC GCG 104 
Gly Phe Val Thr Leu Ala Cys Leu lie His Leu Ser Glu Val Phe Ala 
-15 -10 -5 

GCA TCC GTG CCC CGG AAT ATT GCT CCG AAG TTC TCA ATT CCG GAA AGC 152 
Ala Ser Val Pro Arg Asn lie Ala Pro Lys Phe Ser lie Pro Glu Ser 
15 10 15 

GAG CAG CGA AAC TGG TCG CCT TAG TCT CCT TAC TTT CCC CTA GCC GAA 200 
Glu Gin Arg Asn Trp Ser Pro Tyr Ser Pro Tyr Phe Pro Leu Ala Glu 
20 25 30 

TAC AAG GCT CCT CCA GCA GGC TGC GAG ATT AAC GAA GTC AAT ATT ATC 248 
Tyr Lys Ala Pro Pro Ala Gly Cys Glu lie Asn Gin Val Asn He He 
35 40 45 

CAA CGG CAT GGC GCA CGG TTC CCA ACC TCG GGT GCG GCC ACT CGC ATC 296 
Gin Arg His Gly Ala Arg Phe Pro Thr Ser Gly Ala Ala Thr Arg He 
50 55 60 

AAG GCT GGT TTA AGC AAG CTG CAA TCC GTC CAG AAT TTC ACC GAC CCC 344 
Lys Ala Gly Leu Ser Lys Leu Gin Ser Val Gin Asn Phe Thr Asp Pro 
65 70 75 80 

AAA TTC GAG TTC ATC AAG TCG TTC ACA TAC GAT CTT GGT ACT TCC GAC 392 
Lys Phe Asp Phe lies Lys Ser Phe Thr Tyr Asp Leu Gly Thr Ser Asp 
85 90 9 s 



WO 99/49022 PCUDK99/00153 

9/51 

CTC GTG CCA TTC GGC GCA GGA CAA TCA TTC GAT GCC GGC CTG GAG GTC 440 
Leu Val Pro Phe Gly Ala Ala Gin Ser Phe Asp Ala Gly Lea Glu Val 
100 105 HO 

TTC GCT CGC TAT TCG AAG CTC GTC AGC TCG GAC AAC CTG CCT TTC ATT 488 
Phe Ala Arg Tyr Ser Lys Leu Val Ser Ser Asp Asn Leu Pro Phe lie 
115 120 125 

CGC TCA GAT GGT AGC GAT CGT GTA GTC GAC ACT GCT ACG AAC TGG ACT 536 
Arg Ser Asp Gly Ser Asp Arg Val Val Asp Thr Ala Thr Asn Trp Thr 
130 135 140 

GCA GGT TTT GCT TCC GCG AGC CGC AAC GCG ATC CAA CCC AAG CTC GAC 584 
Ala Gly Phe Ala Ser Ala Ser Arg Asn Ala lie Gin Pro Lys Leu Asp 
145 150 155 160 

TTG ATA CTT CCA CAA ACT GGC AAT GAC ACC CTC GAG GAC AAC ATG TGT 632 
Leu lie Leu Pro Gin Thr Gly Asn Asp Thr Leu Glu Asp Asn Met Cys 
165 170 175 

CCA GCT GCT GGC GAA TCC GAC CCT CAG GTC GAT GCG TGG TTG GCG TCC 680 
Pro Ala Ala Gly Glu Ser Asp Pro Gin Val Asp Ala Trp Leu Ala Ser 
180 185 190 

GCC TTC CCA TCT GTC ACC GCG CAG CTC AAC GCT GCA GCG CCT GGT GCC 728 
Ala Phe Pro Ser Val Thr Ala Gin Leu Asn Ala Ala Ala Pro Gly Ala 
195 200 205 

AAT CTC ACA GAC GCC GAC GCC TTC AAC CTC GTC AGC CTG TGT CCC TTC 776 
Asn Leu Thr Asp Ala Asp Ala Phe Asn Leu Val Ser Leu Cys Pro Phe 
210 215 220 

ATG ACA GTT TCG AAG GAG CAG AAG AGC GAC TTC TGC ACG TTG TTC GAG 824 
Met Thr Val Ser Lys Glu Gin Lys Ser Asp Phe Cys Thr Leu Phe Glu 
225 230 235 240 

GGA ATC CCT GGA TCG TTC GAG GCG TTT GCC TAT GCC GGC GAC CTT GAC 872 
Gly He Pro Gly Ser Phe Glu Ala Phe Ala Tyr Ala Gly Asp Leu Asp 
245 250 255 

AAG TTC TAT GGG ACC GGC TAT GGC CAA GCC CTC GGA CCG GTC CAA GGC 920 
Lys Phe Tyr Gly Thr Gly Tyr Gly Gin Ala Leu Gly Pro Val Gin Gly 
260 265 270 

GTC GGC TAC ATC AAC GAG CTC CTT GCA CGC CTG ACC AAC TCC GCA GTG 968 
Val Gly Tyr He Asn Glu Leu Leu Ala Arg Leu Thr Asn Ser Ala Val 
275 280 285 

AAC GAC AAC ACA CAG ACG AAC CGC ACA CTC GAC GCC GCA CCA GAC ACG 1016 
Asn Asp Asn Thr Gin Thr Asn Arg Thr Leu Asp Ala Ala Pro Asp Thr 
290 295 300 

TTC CCG CTC AAC AAG ACC ATG TAC GCC GAT TTC TCA CAC GAC AAC CTC 1064 
Phe Pro Leu Asn Lys Thr Met Tyr Ala Asp Phe Ser His Asp Asn Leu 
305 310 315 320 

ATG GTC GCC GTG TTC TCC GCC ATG GGC CTC TTC CGC CAA TCC GCA CCG 1112 
Met Val Ala Val Phe Ser Ala Met Gly Leu Phe Arg Gin Ser Ala Pro 
325 330 335 



Fin 
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CTC AGC ACG TCC ACA CCG GAT CCG AAC CGC ACG TGG CTC ACG AGC TCT 1160 
Leu Ser Thr Ser Thr Pro Asp Pro Asn Arg Thr Trp Leu Thr Ser Ser 
340 345 350 

GTC GTT CCG TTC TCC GCG CGC ATG GCC GTG GAA CGC CTC AGC TGT GCT 1208 
Val Val Pro Phe Ser Ala Arg Met Ala Val Glu Arg Leu Ser Cys Ala 
3S5 360 365 

GGT ACC ACG AAG GTG CGC GTC CTG GTG CAG GAC CAG GTC CAG CCA CTC 1256 
Gly Thr Thr Lya Val Arg Val Leu Val Gin Asp Gin Val Gin Pro Leu 
. 370 375 380 

GAG TTC TGC GGC GGC GAC CAG GAT GGG TTG TGC GCG CTA GAC AAG TTC 1304 
Glu Phe Cys Gly Gly Asp Gin Asp Gly Leu Cys Ala Leu Asp Lys Phe 
385 390 395 400 

GTC GAG AGC CAG GCG TAT GCA CGG AGT GGT GGC GCA GGT GAC TTT GAG 1352 
Val Glu Ser Gin Ala Tyr Ala Arg Ser Gly Gly Ala Gly Asp Phe Glu 
405 410 415 

AAG TGT CTT GCG ACG ACG GTG TGAGATGGGG TAATCTACGG TGAAGCAGCG 1403 
Lys Cys Leu Ala Thr Thr Val 
420 

GAGAGCCTCT CAACGAATGC AAAGGATAGG TTCGAGGCTT ACTTCATCAA CCTATATCAT 14 63 
CATAGGACAA GCCCCCCAAT AGCCAGACTC GTCGTTTGAC ATCGTGTATG AAAATAACCC 1523 



ACCCACGCAC TCCGCTGCCA CTATTCGCGT GTATCGCATA CTAGGCGTTT TCGCCCAGTT 1583 
GAACATGAGC CCATTCTGTC CCCAGTGAAA AAAAAAAAAA AAAAAATTCC TGCGGCCGC 1642 
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(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1536 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trametes pubescexis 

(B) STRAIN: CBS 1.00232 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION:79. .1407 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 130. .1407 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: 7 9.. 129 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

GGATCCGAAT TCGCCCCCAC ATTCGTTCCA TCTTAGCAGC CGTCCGCGCC CAGGTCTTCG 60 

ATAACCCCCC GCGTGACT ATG GCC TTC TCA ATC TTG GCC TCG CTG CTC TTC 111 

Met Ala Phe Ser lie Leu Ala Ser Leu Leu Phe 
-17 -15 -10 

GTG TGT TAT GCA TAC GCC AGG GCT GTG CCC CGT GCA CAT ATC CCG CTC 159 
Val Cys Tyr Ala Tyr Ala Arg Ala Val Pro Arg Ala His lie Pro Leu 
-5 1 5 10 

CGC GAC ACC TCC GCG TGT CTA GAT GTA ACA CGC GAT GTG CAG CAG AGC 207 
Arg Asp Thr Ser Ala Cys Leu Asp Val Thr Arg Asp Val Glri Gin Ser 
15 20 25 

TGG TCC ATG TAC TCT CCC TAT TTC CCG GCA GCA ACT TAT GTG GCT CCG 255 
Trp Ser Met, Tyr Ser Pro Tyr Phe Pro Ala Ala Thr Tyr Val Ala Pro 
30 35 40 

CCC GCG AGT TGC CAG ATC AAT CAG GTC CAC ATC ATC CAA CGT CAT GGT 303 
Pro Ala Ser Cys Gin He Asn Gin Val His He He Gin Arg His Gly 
45 50 55 

GCA CGC TTT CCC ACG TCT GGC GCA GCA AAG CGC ATC CAG ACA GCA GTA 351 
Ala Arg Phe Pro Thr Ser Gly Ala Ala Lys Arg He Gin Thr Ala Val 
60 65 70 

GCG AAG CTG AAG GCC GCG TCC AAC TAC ACC GAT CCC CTG CTC GCG TTC 399 
Ala Lys Leu Lys Ala Ala Ser Asn Tyr Thr Asp Pro Leu Leu Ala Phe 
75 80 85 90 

GTT ACG AAC TAC ACC TAC AGC TTA GGT CAG GAC AGC CTC GTT GAA CTC 4 47 

Val Thr Asn Tyr Thr Tyr Ser Leu Gly Gin Asp Ser Leu Val Glu Leu 
95 100 105 
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GGT GCG ACT CAG TCC TCC GAA GCG GGC CAG GAG GCA TTC ACG CGG TAC 
Gly Ala Thr Gin Ser 5er Glu Ala Gly Gin Glu Ala Phe Thr Arg Tyr 
110 115 120 

TCA TCC CTC GTG AGC GCG GAC GAG CTT CCC TTC GTT CGG GCG TCG GGC 
Ser Ser Leu Val Ser Ala Aap Glu Leu Pro Phe Val Arg Ala Ser Gly 
125 130 135 

TCA GAT CGC GTC GTT GCG ACT GCC AAC AAC TGG ACT GCA GGT TTC GCG 
Ser Aap Arg Val Val Ala Thr Ala Aan Aan Trp Thr Ala Gly Phe Ala 
140 145 J- 50 

CTT GCG AGC TCA AAC AGC ATC ACG CCC GTG CTC TCA GTC ATC ATT TCC 
Leu Ala Ser Ser Aan Ser He Thr Pro Val Leu Ser Val He lie Ser 
155 160 165 "0 

GAA GCG GGC AAT GAC ACC CTC GAC GAC AAC ATG TGC CCC GCT GCA GGC 
Glu Ala Gly Aan Aap Thr Leu Aap Aap Aan Met Cya Pro Ala Ala Gly 
175 180 I 85 

GAT TCG GAT CCC CAG GTC AAT CAA TGG CTC GCG CAG TTC GCA CCG CCG 
Asp Ser Aap Pro Gin Val Aan Gin Trp Leu Ala Gin Phe Ala Pro Pro 
190 195 200 

ATG ACT GCT CGC CTC AAC GCA GGC GCG CCC GGC GCG AAC CTC ACG GAC 
Met Thr Ala Arg Leu Aan Ala Gly Ala Pro Gly Ala Aan Leu Thr Aap 
205 210 215 

ACG GAC ACC TAC AAC CTG CTC ACG CTA TGC CCG TTC GAG ACT GTA GCC 
Thr Aap Thr Tyr Aan Leu Leu Thr Leu Cya Pro Phe Glu Thr Val Ala 
220 225 230 

ACC GAG CGG CGT AGT GAA TTC TGC GAC ATC TAC GAG GAG CTG CAG GCG 
Thr Glu Arg Arg Ser Glu Bha Cya Aap He Tyr Glu Glu Leu Gin Ala 
235 240 245 250 

GAA GAC GCC TTC GCG TAC AAT GCC GAT CTC GAC AAG TTC TAC GGC ACT 
Glu Aap Ala Phe Ala Tyr Aan Ala Aap Leu Aap Lya Phe Tyr Gly Thr 
255 260 265 

GGA TAC GGC CAG CCC CTC GGA CCC GTG CAA GGC GTC GGG TAC ATC AAC 
Gly Tyr Gly Gin Pro Leu Gly Pro Val Gin Gly Val Gly Tyr He Aan 
270 275 280 

GAG CTC ATC GCG CGC CTC ACC GCG CAG AAC GTG TCC GAC CAC ACG CAG 
Glu Leu He Ala Arg Leu Thr Ala Gin Aan Val Ser Asp Hia Thr Gin 
285 290 295 

ACG AAC AGC ACA CTC GAC TCC TCG CCC GAG ACG TTC CCG CTC AAC CGC 
Thr Aan Ser Thr Leu Aap Ser Ser Pro Glu Thr Phe Pro Leu Aan Arg 
300 305 310 

ACG CTC TAG GCG GAC TTC TCG CAC GAC AAC CAG ATG GTC GCG ATC TTC 
Thr Leu Tyr Ala Aap Phe Ser His Aap Aan Gin Met Val Ala He Phe 
315 320 325 330 

TCG GCC ATG GGT CTC TTC AAC CAG TCC GCG CCG CTC GAC CCG ACG ACG 
Ser Ala Met Gly Leu Phe Asn Gin Ser Ala Pro Leu Asp Pro Thr Thr 
335 340 345 



lis* AP 
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CCC GAC CCC GCG CGC ACG TTC CTC GTC AAG AAG ATC GTG CCG TTC TCC 1215 

Pro Aap Pro Ala Arg Thr Phe Leu Val Lya Lya lie Val Pro Phe Ser 

350 ' 355 360 

GCG CGC ATG GTC GTC GAG CGC CTC GAC TGC GGC GGT GCG CAG AGC GTG 1263 
Ala Arg Met Val Val Glu Arg Leu Aap Cya Gly Gly Ala Gin Ser Val 
365 370 375 

CGC CTG CTC GTG AAC GAC GCA GTG CAG CCG CTG GCG TTC TGC GGG GCG 1311 
Arg Leu Leu Val Asn Asp Ala Val Gin Pro Leu Ala Phe Cya Gly Ala 
380 385 390 

GAC ACG AGC GGG GTG TGC ACG CTG GAC GCG TTT GTC GAG AGC CAG GCG 1359 
Aap Thr Ser Gly Val Cys Thr Leu Aap Ala Phe Val Glu Ser Gin Ala 
395 400 405 410 

TAG GCG CGG AAC GAT GGC GAG GGC GAC TTC GAG AAG TGC TTC GCG ACA 1407 
Tyr Ala Arg Asn Aap Gly Glu Gly Aap Phe Glu Lya Cya Phe Ala Thr 
415 420 425 

TAGTTCCAGG TGTAGATACC CGGGGAAGAT GTACTCTCTA GACACCTCGC ATGTACTTAT 1467 

CGATTAGAAA GAGACCCTGG CTGCTCTGCC CTCAAAAAAA AAAAAAAAAA AAAAAATTCC 1527 



TGCGGCCGC 



1536 
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(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1501 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Agrocybe pediades 

(B) STRAIN: CBS 900,96 



<ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1*7.. 1375 

(ix) FEATURE: 

(A) NAME/KEY: sigjpeptide 

(B) LOCATION: 17. .94 

(ix) FEATURE: 

(A) NAME /KEY: mat_peptide 

(B) LOCATION: 95. .1375 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

GGATCCGAAT TCACTT ATG TCC CTC TTC ATC GGC GGC TGT TTG CTC GTG 49 
Met Ser Leu Phe lie Gly Gly Cys Leu Leu Val 
-26 -25 -20 

TTT TTA CAG GCG AGC GCA TAC GGC GGC GTC GTG CAG GCC ACA TTC GTG 97 
Phe Leu Gin Ala Ser Ala Tyr Gly Gly Val Val Gin Ala Thr Phe Val 
-15 -10 "5 1 

CAG CCG TTT TTC CCT CCA CAG ATT CAG GAC TCT TGG GCA GCT TAT ACA 145 
Gin Pro Phe Phe Pro Pro Gin He Gin Asp Ser Trp Ala Ala Tyr Thr 
5 10 15 

CCA TAT TAT CCT GTT CAG GCG TAC ACG CCT CCC CCG AAG GAT TGC AAG 193 
Pro Tyr Tyr Pro Val Gin Ala Tyr Thr Pro Pro Pro Lya Asp Cya Lys 
20 25 30 

ATC ACA CAA GTT AAC ATT ATT CAA CGA CAT GGT GCC CGC TTT CCG ACA 241 
He Thr Gin Val Asn He He Gin Arg Sis Gly Ala Arg Phe Pro Thr 
35 40 45 



TCG GGG GCA GGC ACA AGG ATC CAA GCA GCT GTG AAG AAG CTT CAA TCA 289 
Ser Gly Ala Gly Thr Arg He Gin Ala Ala Val Lya Lya Leu Gin Ser 
50 55 60 



65 



GCT AAA ACC TAT ACG GAT CCT CGT CTC GAC TTT CTG ACC AAC TAT ACC 337 
Ala Lys Thr Tyr Thr Asp Pro Arg Leu Asp Phe Leu Thr Asn Tyr Thr 
70 75 B° 

TAT ACC CTT GGT CAC GAC GAT CTC GTA CCG TTT GGA GCG CTT CAA TCA 385 
Tyr Thr Leu Gly His Asp Asp Leu Val Pro Phe Gly Ala Leu Gin Ser 
B5 " 90 95 
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TCA CAA GCT GGA GAG GAA ACG TTT CAA CGA TAC TCG TTT CTG GTG TCC 
Ser Gin Ala Gly Glu Glu Thr Phe Gin Axg Tyr Ser Phe Leu Val Ser 
100 105 110 

AAA GAG AAC TTA CCT TTT GTA AGA GCT TCG AGT TCfc AAT CGA GTC GTC 
Lys Glu Asn Leu Pro Phe Val Arg Ala Ser Ser Ser Asn Arg Val Val 
115 120 125 

GAC TCA GCT ACC AAC, TGG ACG GAA GGT TTT TCT GCG GCC AGT CAC CAC 
Asp Ser Ala Thr Asn Trp Thr Glu Gly Phe Ser Ala Ala Ser His His 
130 135 140 145 

GTC TTG AAT CCC ATT CTC TTT GTA ATC CTC TCA GAA AGT CTC AAT GAC 
Val Leu Asn Pro lie Leu Phe Val He Leu Ser Glu Ser Leu Asn Asp 
150 155 160 

ACG CTT GAC GAT GCC ATG TGC CCT AAC GCG GGC TCC TCC GAC CCG CAG 
Thr Leu Asp Asp Ala Met Cys Pro Asn Ala Gly Ser Ser Asp Pro Gin 
165 170 175 

ACT GGT ATC TGG ACC TCG ATA TAC GGG ACG CCT • ATT GCC AAC CGA CTA 
Thr Gly lie Trp Thr Ser lie Tyr Gly Thr Pro lie Ala Asn Arg. Leu 
180 185 190 

AAT CAG CAG GCT CCG GGT GCA AAT ATT ACA GCT GCC GAT GTG TCG AAC 
Asn Gin Gin Ala Pro Gly Ala Asn lie Thr Ala Ala Asp Val Ser Asn 
195 200 205 

CTT ATA CCG CTT TGC GCA TTC GAG ACG ATA GTA AAG GAG ACG CCA AGT 
Leu lie Pro Leu Cys Ala Phe Glu Thr He Val Lys Glu Thr Pro Ser 
210 215 220 225 

CCT TTC TGT AAT TTG TTC ACC CCC GAA GAG TTC GCA CAG TTT GAA TAT 
Pro Phe Cys Asn Leu Phe Thr Pro Glu Glu Phe Ala Gin Phe Glu Tyr 
230 235 240 

TTC GGT GAC CTG GAC AAG TTC TAT GGG ACA GGT TAT GGA CAA CCG TTA 
Phe Gly Asp Leu Asp Lys Phe Tyr Gly Thr Gly Tyr Gly Gin Pro Leu 
245 250 255 

GGA CCT GTG CAA GGT GTC GGC TAC ATC AAT GAA CTT CTT GCC CGA CTC 
Gly Pro Val Gin Gly Val Gly Tyr He Asn Glu Leu Leu Ala Arg Leu 
260 265 270 

ACA GAA ATG CCA GTT CGA GAT AAC ACC CAG ACG AAC AGG ACA CTC GAC 
Thr Glu Met Pro Val Arg Asp Asn Thr Gin Thr Asn Arg Thr Leu Asp 
275 280 285 

TCT TCT CCG CTT ACA TTT CCC CTC GAC CGC AGT ATC TAC GCT GAC. CTC 
Ser Ser Pro Leu Thr Phe Pro Leu Asp Arg Ser He Tyr Ala Asp Leu 
290 295 300 305 

TCG CAC GAT AAC CAA ATG ATC GCG ATA TTT TCA GCG ATG GGT CTT TTC 
Ser His Asp Asn Gin Met He Ala He Phe Ser Ala Met Gly Leu Phe 
310 315 320 

AAC CAG AGT TCA CCT TTG GAT CCG TCC TTC CCC AAC CCC AAG CGT ACT 
Asn Gin Ser Ser Pro Leu Asp Pro Ser Phe Pro Asn Pro Lys Arg Thr 
325 330 335 
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TGG GTC ACC AGT CGG CTT ACG CCT TTC AGC GCG AGA ATG GTC ACT GAG 
Trp Val Thr Ser Arg Leu Thr Pro Phe Ser Ala Arg Met Val Thr Glu 
340 345 350 

CGG TTG CTG TGT CAA AGG GAT GGG ACA GGG AGC GGT GGA CCA TCC AGG 
Arg Leu Leu Cya Gin Arg Asp Gly Thr Gly Ser Gly Gly Pro Ser Arg 
355 360 365 

ATC ATG CGG AAT GGA AAT GTG CAG ACG TTT GTG AGG ATT CTT GTC AAC 
He Met Arg Asn Gly Aan Val Gin Thr Phe Val Arg " He Leu Val Asn 
370 375 380 385 

GAT GCT TTA CAG CCT TTG AAG TTC TGC GGA GGG GAC ATG GAT AGT TTG 
Asp Ala Leu Gin Pro Leu Lya Phe Cya Gly Gly Asp Met Asp Ser Leu 
390 395 400 

TGT ACT CTG GAA GCG TTC GTC GAG AGC CAG AAG TAT GCA CGA GAG GAT 
Cya Thr Leu Glu Ala Phe Val Glu Ser Gin Lya Tyr Ala Arg Glu Aap 
.405 410 415 

GGT CAA GGC GAT TTT GAA AAA TGT TTT GAT TAAATATTGC AGTATGCTCA 
Gly Gin Gly Asp Phe Glu Lya Cys Phe Asp 
420 425 

GTGAGTAGAC TACAGTGCAG GCCCTGTAAC TCTTGTATTG TGTTTCTGGA ATTCCTCGGA 
GCGTAGTTTG TAGCAAAAAA AAAAAAAAAA AAATTCCTGC GGCCGC 
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(2) INFORMATION FOE SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: X593 base pairs 

(B) TYPE: nucleic acid 

(C) STRAHDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Peniophora lycii 

(B) STRAIN: CBS 686.96 

(ix) FEATURE: 

(A) NAME /KEY: sig_peptide 

(B) LOCATION : 123 • • 212 

(ix) FEATURE: 

(A) NAME /KEY :. matjpeptide 
(3) LOCATICfN:213. .1439 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 123 • . 1439 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 



GGATCCGAAT TCCATCTTCT GCTCTGACCT CCATCTCGCT GAGCGGCCGA CGAGAACCTA 60 

GGGGCTCTAA GTCCACGTAC TATCGCCGCG CCTGTGAAGG CCCCATACCA GCCCTTATCG 120 

AT ATG GTT TCT TCG GCA TTC GCA CCT TCC ATC CTA CTT AGO TTG ATG 167 
Met Val Ser Ser Ala Phe Ala Pro Ser lie Leu Leu Ser Leu Met 
-30 -25 -20 

TCG AGT CTT GCT TTG AGC ACG CAG TTC AGC TTT GTT GCG GCG CAG CTA 215 
Ser Ser Leu Ala Leu Ser Thr Gin Phe Ser Phe Val Ala Ala Gin Leu 
-15 -10 -5 1 

CCT ATC CCC GCA CAA AAC ACA AGT AAT TGG GGG CCT TAC GAT CCC TTC 263 
Pro He Pro Ala Gin Asn Thr Ser Asn Trp Gly Pro Tyr Asp Pro Phe 
5 10 15 

•TTT CCC GTC GAA CCG TAT GCA GCT CCG CCG GAA GGG TGC ACA GTG ACA 311 
Phe Pro Val Glu Pro Tyr Ala Ala Pro Pro Glu Gly Cys Thr Val Thr 
20 25 30 

CAG GTC AAC CTG ATT CAG AGG CAC GGC GCG CGT TGG CCC ACA TCC GGC 359 
Gin Val Asn Leu He Gin Arg His Gly Ala Arg Trp Pro Thr Ser Gly 
35 40 45 

GCG CGG TCG CGG CAG GTC GCC GCC GTA GCG AAG ATA CAA ATG GCG CGA 407 
Ala Arg Ser Arg Gin Val Ala Ala Val Ala Lys He Gin Met Ala Arg 
50 55 60 65 

CCA TTC ACG GAT CCC AAG TAT GAG TTC CTC AAC GAC TTC GTG TAC AAG 455 
Pro Phe Thr Asp Pro Lys Tyr Glu Phe Leu Asn Asp Phe Val Tyr Lys 
70 ' 75 80 
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TTC GGC GTC GCC GAT CTG CTA CCG TTC 6GG GCT AAC CAA TCG CAC CAA 
Phe Gly Val Ala Asd Leu Leu Fro Phe Gly Ala Asn Gin Ser His Gin 
85 90 " 

ACC GGC ACC GAT ATG TAT ACG CGC TAC AG* ACA CTA TTT GAG GGCGGG 
Thr Gly Thr Asp Met Tyr Thr Arg Tyr Ser Thr Leu Phe Glu Gly Gly 
100 105 I 10 

GAT GTA CCC TTT GTG CGC GCG GCT GGT GAC CAA CGC GTC GTT GAC TCC 
Asp Val Pro Phe Val Arg Ala Ala Gly Asp Gin Arg Val Val Asp Ser 
115 120 125 

TCG ACG AAC TGG ACG GCA GGC TTT GGC GAT GCT TCT GGC GAG ACT GTT 
Ser Thr Asn Trp Thr Ala Gly Phe Gly Asp Ala. Ser Gly Glu Thr Val 
130 135 140 "5 

CTC CCG ACG CTC CAG GTT GTG CTT CAA GAA GAG GGG AAC TGC ACG CTC 
Leu Pro Thr Leu Gin Val Val Leu Gin Glu Glu Gly Asn Cys Thr Leu 
150 155 160 

TGC AAT AAT ATG TGC CCG AAT GAA GTG GAT GGT GAC GAA TCC ACA ACG 
Cys Asn Asn Met Cys Pro Asn Glu Val Asp Gly Asp Glu Ser Thr Thr 
165* 1*70 175 

TGG CTG GGG GTC TTT GCG CCG AAC ATC ACC GCG CGA TTG AAC GCT GCT 
Tn> Leu Gly Val Phe Ala Pro Asn lie Thr Ala Arg Leu Asn Ala Ala 
180 185 190 

GCG CCG AGT GCC AAC CTC TCA GAC AGC GAC GCG CTC ACT CTC ATG GAT 
Ala Pro Ser Ala Asn Leu Ser Asp Ser Asp Ala Leu Thr Leu Met Asp 
195 200 205 

ATG TGC CCG TTC GAC ACT CTC AGC TCC GGG AAC GCC AGC CCC TTC TGT 
Met Cys Pro Phe Asp Thr Leu Ser Ser Gly Asn Ala Ser Pro Phe Cys 
210 215 220 225 

GAC CTA TTT ACC GCG GAG GAG TAT GTG TCG TAC GAG TAC TAC TAT GAC 
Asp Leu Phe Thr Ala Glu Glu Tyr Val Ser Tyr Glu Tyr Tyr Tyr Asp 
230 235 240 



CTC GAC AAG TAC TAT GGC ACG GGC CCC GGG AAC GCT CTC GGT CCT GTC 
Leu Asp Lys Tyr Tyr Gly Thr Gly Pro Gly Asn Ala Leu Gly Pro val 

250 255 



245 



CAG GGC GTC GGA TAC GTC AAT GAG CTG CTT GCA CGC TTG ACC GGC CAA 
Gin Gly Val Gly Tyr Val Asn Glu Leu Leu Ala Arg Leu Thr Gly em 
260 265 270 

GCC GTT CGA GAC GAG ACG CAG ACG AAC CGC ACG CTC GAC AGC GAC CCT 
Ala Val Arg Asp Glu Thr Gin Thr Asn Arg Thr Leu Asp Ser Asp Pro 
275 280 285 

GCA ACA TTC CCG CTG AAC CGT ACG TTC TAC GCC GAC TTC TCG CAT GAT 
Ala Thr Phe Pro Leu Asn Arg Thr Phe Tyr Ala Asp Phe Ser His Asp 
290 295 300 305 

AAC ACC ATG GTG CCC ATC TTT GCG GCG CTC GGG CTC TTC AAC GCC ACC 
Asn Thr Met Val Pro He Phe Ala Ala Leu Gly Leu Phe Asn Ala Tft- 
310 315 320 
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GCC CTC GAC CCG CTG AAG CCC GAC GAG AAC AGG TTG TGG GTG GAC TCT 1223 
Ala Leu Asp Pro Leu Lya Pro Asp Glu Asn Ary Leu Trp Val Asp Ser 
32S 330 335 

AAG CTG GTA CCG TTC TCT GGA CAT ATG ACG GTC GAG AAG CTG GCA TGT 1271 
Lys Leu Val Pro Phe Ser Gly His Met Thr Val Glu Lys Leu Ala Cys 
340 345 350 

TCT GGG AAG GAG GCG GTC AGG GTG CTC GTG AAC GAC GCG GTG CAG CCG 1319 
Ser Gly Lys Glu Ala Val Arg Val Leu Val Asn Asp Ala Val Gin Pro 
353 360 365 

CTG GAG TTC TGC GGA GGT GTT GAT GGG GTG TGC GAG CTT TCG GCT TTC 1367 
Leu Glu Phe Cys Gly Gly Val Asp Gly Val Cys Glu Leu Ser Ala Phe 
370 375 3 8 o 3 85 

GTA GAG AGC CAG ACG TAT GCG CGG feAG AAT GGG CAA GGC GAC TTC GCC 1415 
Val Glu Ser Gin Thr Tyr Ala Arg Glu Asn Gly Gin Gly Asp Phe Ala 
330 395 400 

AAG TGC GGC TTT GTT CCG TCG GAA TAGCGGGAGA CCGTCTATGC TACACAGTAA 1469 
Lys Cya Gly Phe Val Pro Ser Glu . 
405 

TTGTGTACTC TATAGCACTG TAGCTGTACT TACAAGTCGT AGGGTACGAT CGTACTTACG 1529 

CTCGTTTATT GATCCTTCCT TTAAAAAAAA AAAAAAAAAA AAAAAAAAAA ATTCCTGCGG 1589 

CCGC 1593 
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ca-aggagacgvgcTCccsctcgggcgcggctgcggffZSxgggcgrtg'cc.TcggacgffEga 120 
79 a 9ST7gsracgggc^gggcgrzgatgacggracgaacgcgaacggacacaggccgccgag 180 
cgrgggrgttgcgrzccaaccztcccccgrgxgggxgxgcacg^gtgggrtgrcgtacgxgc 240 
ttgggggggggaazgtitcrtgjtaattatctttctaccctecttctccttcctttattct: 300 
gttcagcaggtatacceegcgcaagcgcacaggatcatgggacgggtgggcggatggaec 3 SO 
aertctiagaaggacggacaaggaaaaaggggaaacacgaatatggcgccccggsrbggcgc 420 
gtcgagctggatgcttgacgccggtctggcaaaeateetcttettetagcacccaaccta 4ao 
gt act t gat agagtgttteggggccaggcggtttgcgctgtgtttetaccaat caeca ac 540 
bagtgctactactattattgcggctgrtgatgcagccgxgtaccaaaaa tgcegcggcac 600 
ctceattgatacttgtagttttgatagatcaacatctgggaggtcgcgctgggctgetct 6 60 
gaaacccctetctcttgccgtacgtaacgtafcgtgcaeagtatgteaccgacaaagacga 720 
t tgcacgcgcatcgttttttgxtgt gtttcaggcctcgctcgt gt ctagggtataaacac 780 
a ttgaagactacatatgcgcaagacgttgacatcaacggggncctgcagccgccgcaggt 8 40 
gcatgtcgtgattaataecacgcgcctgcgtaaatcagctagccgccgccctgtttcaet 9QQ 
cggttagagacggacaggtgagaegggtctcggttaagcaagcaaactggaatgcaaggt 9 60 
tgaaggtgtaatctgcatagcgtggaaatgagagggct ct gcgggcagccaggaaggtga 1020 
gacgaaacgaggaxagaggeaccagaagctgtcgtcctgaagtgcccgcggxcatagctc 1080 
caggattaagtacggatgtcccatgecaagctgctggcttcgaaagcgagtacggagtag 1140 
tgtccattgttcacgagggatccccaa tgtgttagacatgcctgaatcaatttcgcccta 120 0 
tttttggatttcaactgtttct cccgactgxgctcggtagcgactatgccgcaaggtaea 12 60 
ctaeatgttgtacaataazcatacaccgacettccgtaggagtgctgaaataeccgacct 1320 
gctcxctctagcaggtgcctaatggctitcgtgtaactcgatcgaaacggateagcaagt 1380 
ccatttgctgttggrtgagatgtacgattfcacaaacacgtggagaggtgagccacagcga 1440 
taggcttctggaaggattctggcgtctcggaaagagggccactcgccccactaaccggcg 1500 
ccgatcttgacatggggctcgcagggggtttaagtgcacactacggagtacggattacac 15 60 
agrtay LyLatgggtgggggcgagtttgggtggccttgtgtggggctcaccggctgcctgt 1520 
tctcggggagtcttggcgggccgattggacccacctaaccacgggtagtcttggcccggc 1580 
caactcacacegccctcacgtttcggagceagteaggsagyesggeactactcagteagg 1740 
tacacacgtcgggctoctcgatgctgggtgacatcgaggcgatactgcattccaactacg 1800 
gttggcataggaggtatcctattctagagctgttctacgccggaacgtaacecgggataa I860 
ce e g g g a tatcgcttcectgagegagcgcgctgctgaggateatacaacceaacaaccga 1920 
cgacggtgcaagaaggttgggggaaggaagaaatcaaggaaaaaaaaatagggggggtgg 1980 
ggaccaag-agagaaagaaaggagaaaagggrggggggagggaagagaaaaaaaaaaegga 20 40 
ggaatatggcgtegctcttcgactggttccggaagggggcatctgggtacacatatgeac 2100 
ctcttccgcacggcagggatataaaccgggagtgcagtcccaccgatcacgctgagtccg 2160 
cccgtetccagacztcacggtcgcagaggactagacgcgcggtgaagatgactggcctcg 2220 

M X G L G 5 

gagtgatggtggrgatggtcggcttcecggcgatcgcctctetgtaagcagcgatcccag 2280 
VMVVMVGr L A I A 5 L. 19 

gggtccggtgtgcgttaaaagaaaaagctaacgccaccagacaatccgagtcccggccat 2340 

Q2ESKFC 26 

gcgacaccccagacttgggcttccagtgtggtacggccatttcccacttctggggccagt 2400 
DTPDXGFQCGTAISHFWGQr 46 

actcgccctacttctccgtgeceteggagctggatgcttcgatccccgacgactgcgagg 24 60 

sprrs v?s eldasxpddcev ss 

tgacgtttgcccaagtcctctcccgcsacggcgcgagggcgccgacgcccaaacgggccg 2520 
TrAQV2.SRHGARAPTl.KRAA 86 

cgagctacgtcgacctcatcgacaggacccaccacggcgccatctcctacgggccgggct 2580 
S Y V D t I D R I K HGA I S X Q P G < * 106 

acgagttccteagcacgtacgactacaccctgggcgcegacgagcteacccggacgggcc 2640 
EFLRTXDYTLGADELTRTGQ 126 



agcagcagatgg^caactcgggcatcaagttttaccgcegceaccgcgctctcgcccgca 2700 
QQKVHSGZ XTYRR^RALARK 146 
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as^cgatcccciifccgTccgcaccgccggccaggaccgcgtcgtccactcggccgagaac!: 27 £0 
SIPFVRTAGQ D RVVHSAENF 166 

tcacccagggcttccactcrgccctgctcgccgaccgcgggtccaccgxccggcccaccc 2820 
T Q G F H S A L LA D R G . S X V R ? T L 18 6 

tcccctatgaeafcggrcgtcatcecggaaaccgeeggcgccaacaacacgctccacaaeg 2880 
P*D.MVVIP ETAGAKNTLHWD 206 

acctctgcaccgccttcgaggaaggcccgxactcgaccatcggcgacgacgcccaagaca 2340 
&CTAFZEG? *J£ 5 TXGODAQOT 226 

cctaectctccaectzcgccggacecatcaccgcccgggxcaacgccaacctgccgggcg 3000 
*I*S?FAGP I TA R VNANLPGA 246 

ccaacctgaccgac^ccgacacggrcgcgctgatiggacctctgccccttcgagacgglicg 30 60 
N L T D A D T V A I* * M D I» C P FETVA 266 

cctcctcctcctccgacecggcaacggcggacgcggggggcggcaacgggcggccgctg^ 3120 
SSSSDPATADAGGGNGRPLS 286 

cgcccttctgccgcctgtwcagcgagtccgagrcggcgcgcgtacgactacctgcagccgg 318 0 
PFCRLFS E S EWRAXD*I*QSV30 6 

tgggcaagtggtacgggcacgggccgggcaacccgctggggccgacgcagggggtcggg^ 3240 
5 K W * G X G'P G N P L G P T.Q G V G .F 326 

tcgtcaacgagctgctggcgcggctggccggggxccccgtgcgcgacggcaccagcacca 3300 
V. KELLARLAGVP VRDGTSTN 346 

accgcacccfccgacggcgacccgcgcaccttcccgctcggccggcccctctacgccgact 3360 
RTLDGDPRTFP LGRPLYADT 366 

tcagccacgacaacgacafcgafcgggcgtectcggcgccctcggcgcctacgaeggcgtcc 3 420 
SHDNOHWGVLGALGAYDGVP 386 

cgcccct cgacaagaccgcccgccgcgacccggaagagc tcggcggg t acgcggccagct 3 4BO 
PI.DXTARRD P EELGGrAASH40 6 

ffsgecgtcccgttcgccgccaggaectacgtcgagaagatgcggtigcagcggcggcggcg 3540 
AVPFAARI Y V E KKRCS GGGG 426 

gcggcggcggcggcggcgaggggcggcaggagaaggatgaggagatggrcagggtgctgg 3 600 
GGGGGEGRQE KDEEHVRVLV446 

tgaacgaccgggcgatgacgctgaaggggtgcggcgccgacgagagggggacgcgtacgc 3 660 
KDRVMTLKG C GADERGMCTL 466 

tagaacggrtcaccgaaagcaeggcgtttgcgagggggaacggcaagtgggatctctgct 3720 
ERFIESMAFARGKGKWDt*CF 486 

ttgcttgatatgcccacgecegagattgaacagaacttgtgatgggggtagagcgtggta 3780 



tccgagatgacagcccacagttttcgggaatcaaaaatcggttagaccggcgaaacccaa 38 40 

gtctggggcctgcggcgcctgcatcccccgccccctg^cgtcacctccttaatggtcctt 3900 

tcccattttticatctccct:tiaaatc.ttcacacaaacct:cctatcgtccctcccccctcct 39 60 

fctccctcceccgcacatcggatgggaattgtcgac 3395 
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i AG^TTCAACGACGSAGG^^ 50 

M V T 3 

S i TCIGACl u kCCTGC i u iCGGCGGCGTATCTGC L'i TCTGGgcgagwggccc 100 

LTFLLSAAYtLSG IS 

■ • • ♦ « 

10 1 ggacccaccgcccggacacggccgcggcgCwgacccwgaaacggagTnGA ISO 

R 17 

• « ■ • • ■ 

151 CTGTCrCCGGCACCTAGTTCT^ 200 

VSAAPSSAGSKSCDT-VD .34 

• • • • * 

20 1 CCTCGGGTACC^GTGCrCCCCTGCGACTTCTCATCTATGGGGCCAGrACr 2 S 0 

LGYQCS.PATS KLWGQYS SI 

• • * • * 

251 CGCCATTCTTTTCGCTCGAGGACGAGCTGXCCGTGTCGAGTArtGCITCCC 3 0Q 

P F F 5 LED E LSVSSKLP- 67 

• • • • • 

301 AAGGATTGCCGGATCACCTTGGTACAGGTGCrATCGCGCCATGGAGCGCG 3 S 0 

KDCRITLVQVLSRKGAR B4 

• • ■ • • 

3 3 L GTACCCAACCAGCCCC^AGAGCAAAAWTT 4 0 0 

YPTSSKSKKYKKLVTAX 101 

• • • • • 

401 TCCAGGCCAATCCCACCGACTICAAGGGCAAGTTTGCLI u i k i'GAAGACG 4 50 

QAMATDFKGKFAFLKT 117 

• • « • 

451 TAC^ACTATACTCZGGGTGCGGATGACCTCACrCCLl 1 lGGGGAGCAGCA S 0 0 

YNYTLGAO D LTP'cGEQQ 134 
+- 

501 GCTGSraAacrcGGCCATCArcTT^ 530 

LVNSG ISCcYQRYKAIjAR LSI 

Sal GC\G7G7GGTGCCCj i tTAITCGCGCCTC\C^CTCGGACCCKX:TTATTGCT S 0 0 

SVV9FIRASGSDRVIA 1S7 
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• • • > * 

fi 0 1 TCCKKAGAGAACnTCAT^ 550 

S G E K F I £ G FQQAKLA'D P 134 

• • ■ • • 

551 TGGCGOjACGAACCGCGCCwCTCCGGCSATT^^ 7 q q 

GATNR AAPAISVIIPES 202. 

• • • • • 

7 Q I GCGAGACGTTC^CAATACGCTGGACC^CGGTGTGTGCACCA ^ I l ill AG 7 SO 

ETFWNTLD KGVCTKFE 217 

• • • • • 

751 GCGAGTC^CIT^MATX^ 8 00 

A S Q LG.D E V A A N" F T A L F A 234 



8 Q 1 ACCCGACATCCGAGCTCGCGCCGAGAAGCATCrTCCTGGCGTGACGCTGA 8 50 
P D I RA R i A E KK L ? GV T L T 251 



851 CACACCACGACG l 1 0 i. C ACTCTAATGGAC A l G l' G I TCGTTTG ATACGGTA 900 

DEDVVSLMDMCSFDTV 257 

• • • • • 

901 GCGCGCACC^GCGAcrcrj^f r rrArc-rTrrrAC ci., t ti iu i L AArrnrcAC 9S0 

ARTSDASQLSPFCQLFT 284 



• • • • • 

951 TCACAATGAGTGGAAG AAGT ^ CAACTAGCT T CALi i LL i l ItGGPAAGTACT 1000 

X'NSWKKY'NYLQSLGKYY 301 

• • • « * 

1001 ACGGCTACGGCGCAGGCAACCCTCTCGGACCGGCrCAGGCGATAGGGT^ 1050 

GYGAGN P L GPA QGIGF 317 

• • • • 

1051 ACCAACGAGCTGAT7CCCCGGTTGACTCGTTCGCCAGTGCAGGACCACAC 1100 

T N E L . I *A RLT RS PVQ DKT 334 . 

• • • • • 

1101 C^CACTA^CTCGACTCTAGTCTCCAACCCGGCCACCTTCCCGTTGAACG 1150 

STfrTSTLVS NPATFPLNA 351 



• « • • • 

1151 CTACCATGTACGTCGA L l l ^V C^C^CGAC^C^^TGGTrrCCATCrrC 1200 
TMYVDFSKDN5MVS IF 337 



1201 TTrGCVTtCCOCCTGTACVXCC<jC^CTGAACCCTTGT 1250 
FALGLVMGTSPLSRTSV 334 
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• • • • • 

12 S 1 GGAAAGCGCCAAGGAATTCGATGGCTATT^ 1300 
ES AKELDGYSA SWVV?F 401 



• ■ • • * 

1301 TCGGCGCGCGAGCCTACTTCCA« 13 SO 

G A R A Y c ETMQCX'SEKE 417 



13 S 1 CCT L.I1.U 1 J. CGCG C 1 1 1U ATTAATGACCGG G 1 i.GCCACTGCATGGCTG 1400 
PLVRALIMD RVVPLKGC 434 



14 0 1 CaVIGXGGACAAGCrCSGGCGATGCAM 14 50 

DVDKLGRCKLNDFVKGL 4S1 



• • • • • 

1451 TOPJGTrGGGCCAJ^TCTGGGGGCAACTG^ 1500 
SW.ARSGGNWGECFS 465 



1501 
1S51 



issa 

1571 
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A. 
A. 
A. 
A. 
A. 
A. 
A. 
A. 
A. 



csrrsus 9A-1 
c arrays cbs 



MGFLalVLS. VaLLfrsTSG TrLGprg K hsCCMSVDhG 
MGVFv VLL5 . iacLfgsTSG TALGprg. .N hsDCTSVDrG 
■i MGVsaVLL?. lYLLagVTSG LAV?asr..M qsTCOTVDQG 
MGVsaVLL?. lYLLagVTSG iAVPasr. .N qsTCDTVCQG 
MGVsaVLL?. JLYLLsqVTSG lAVPasr. .N gsSCDTVDQG 

MVtLcFLLSa AYLLsgVSAA. PSsA G SkSCDTVDIG 

MVtLtFLLSa AYLLsgVSAA PSsA G SkSCDTVDIG 

MVtLtFLLSa AYLLsgVSAA PSsA G SkSCDTVDIG 

MVtLtFLLSa AYLLsgVSAA PSsA G SkSCDTVDIG 

MGaLtFILSV mYLLsgVAGA PSsGcsagsG SfcACDTVSlG. 
MAFFtVaLSL yYLLsrVSAQ APW. . . . .Q MKSCNTADGG 

MSLLlLVLSg GLValyVSrN PKV D SHSCNTVEGG 

MVgFlAJaSL esE SRPCOTpOlG 



SO 

YQCFPELSHJc 
YQCFPELSHk 



aiger var. awama. 
nige* T213 
r.iger MRRL3135 



YCCF5ETSKL 
YQCFSSTSHL 
YQCFSETSHL 



fumigacus 13073 
cumigacus 32722 
fumigacus S3 128 
fumlgacus 26906 
fumlgacus 32239 



YQCsPATSHL 
YQCsPATSHL 
YQCsPATSHL 
YQCsPATSHL 
YQCs PGTSHL 



7. 



nidulans 

chersiophilus. 

chezmophila 

Consensus 
Conphya 



YQCFPNVSHV 
YQCrFSISHs 
FQCgTAISHF 



MGFL-VLLSL GYLL — VSAG ??VG N SKSCDTVDGG YQCFPEISHL 

HGVfWIXS. IATLFGSTAG YALGPRG. .N SKSCDTVDGG YQCFPEISHL 



A. 
A. 
A. 
A. 
A. 
A. 
A. 
A. 
A. 
A. 
A. 

r. 



terreus SA-1 
terreus cbs 



51 

WGIYAPYFSL 
WGIYAPYFSL 



QDESPFPIOV 
Q0ESPFP1DV 



PEDCHITFVQ 
PODCHITFVQ 



VLARHGARsP 
VLARHGARsP 



100 

ThSKtKAYAA 
TDSKtKAYAA 



nigsr var. awamoz 
niger T213 
nigec NR RL3135 



i WGQYAPFFSL 
WGQYAPFFSL 
WGQYAPFFSL 



ANESAISPDV 
ANESVISPDV 
ANESVISPEV 



PAGCRVTFAQ 
PAGCRVTFAQ 
PAGCRVTFAQ 



VLSRHGARY? 
VLSRHGARY? 
VLSRHGARYP 



TESKgKicYSA 
TESKgKJcYSA 
TDSKcKkYSA 



fumigacus 13073 
fumigacus 32722 
fumigacus 58128 
fumigatus 26906 
fumigacus 32239 



WGQYSPFFSL 
WGQYSPFFSL 
WGQYSPFFSL 
WGQYSPFFSL 
WGQYSPFFSL 



EDS1SVS5KL 
EDE1SVS3KL 
EDE1SVSSKL 
EOEISVSSKL 
EDEISVSSDL 



PKDCRITLVQ 
PKDCRITLVQ 
PKDCRXTLVQ 
PKDCRITLVQ 
PKDCRVTFVQ 



VLSRHGARY? 
VLSRHGARYP 
VLSRHGARY? 
VLSRKGARY? 
VLSRHGARY? 



TSSKs KkYKk 
TSSKs KkYKk 
TSSKs KJcYKJc 
TSSKs KkYKk 
TASKS KlcYKlc 



nidulans 

chezrmophilus 

thaxmophlla 

Consensus 
Conphys 



WGQYSPYFSI 
WGQYSPFFSL 
WGQYSPYFSV 



EQESAISeDV 
ADQSEZSPOV 
pSSlDaS. .1 



PHGCEVTFVQ 
FQNCKITFVQ 
PDDCEVTFAQ 



VLSRHGARY? 
LLSRKGARY? 
VLSRHGARa? 



TESKsKAYSG 
TSSKtElYSQ 
TLKRaaSYvD 



WGQYSPYFSL EDESAISPDV PDDCRVTFVQ VLSRHGARYP TSSX-KAYSA 
WGQYSPYFSL EDESAISPDV PDDCRVTFVQ VLSRSGA3YP TSSKSKAYSA 



A. 
A. 
A. 
A. 

A. 
A. 
A. 
A. 
A. 
A. 
T. 
M. 



terreus 9A-1 
csrraus cbs 



101 

tIAAIQKSAT 
tlAAIQKMAT 



aFpGKYAFLQ 
aLDGKYAFLK 



SYNYSLOSEE 
SYNYSMGSEN 



LTPFGrMQLr 
LTPFGrNQLc 



150 
DIGaQFYeRY 
DIGaQFYRRY 



niger var. awamaxi 
niger T213 
niger NRRL3135 



LIESIQQNVT 
LIEEIQQNVT 
LIEEIQQMAT 



tFDGKYAFLK 
t F DGKYAFLX 
tFDGKYAFLK 



TYNYSLGADD 
TYNYSLGADD 
TYNYSLGADD 



LTPFGEQELV 
LTPFGEQELV 
LTPFGEQELV 



NSGIKr YQRY 
NSGIKFYQRY 
NSGIKFYQRY 



fumigacus 13073 
fumlgatus 32722 
fumigacus 58128 
fumigacus 26906 
fumigacus 32239 



LVTAIQaNAT 
LVTAIQaNAT 
LVTAIQaNAT 
LVTAIQaNAT 
LVTAIQKMAT 



dFKGKFAFLK 
dFKGKFAFLK 
dFKGKFAFLK 
dFKGKFAFLK 
eFKGKFAFLS 



TYNYTLGAD0 
TYNYTLGADD 
TYNYTLGADO 
TYNYTLGADD 
TYNYTLGADD 



LTPFGEQQLV 
LTPFGEQQLV 
LTPFGEQQLV 
LTAFGEQQLV 
LTPFGEQQMV 



NSGIKFYQRY 
NSGIKFYQRY 
NSGIKFYQRY 
NSGIKFYQRY 
NSGIKFYQKY 



nidulans 
cheraoph litis 
charmophila 

Consensus 
Conphys 



LIEAIQKNAT 
LISrIQKTAT 
LIOrlKhGAI 



sFwGQYAFLE 
aYKGyYAFLK 
sYgPgYEFLR 



SYNYTLGADO 
DYrYqLGAHD 
TYDYT LGAOE 



LTiFGENQMV 
LTPFGENQMI 
LTRtGQQQMV 



DSGaKFYRRY 
QLGIKFYnHY 
NSGIKFYRRY 



LIIAIQ-OIAT -FKGKYAFLK TYNYTLGADD LTPFGENQMV NSGIKFYRRY 
LIEAIQKNAT AFKGKYAFLX TYNYTLGADD LTPFGZKQMV NSGIKFYRRY 
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A. 
A. 
A. 
A. 
A. 
A. 
A. 
A. 
A. 
A. 
A. 



zarraus 9A-1 
carraus cbs 



IS I 

NALTRhleP? 
DTLTahlnPF 



VRATDASRVh 
VRAAdSSRVh 



ESAEXFVSGF 
SSAEXrVEGF 



QTARqODKhA 
QMARqGOPnA 



200 

nPHQPSPrVd 
nPHQPSPrVd 



niger var. avaaiori 
niger T213 
niqec NRRL3135 



ESLTRNIIPF 
E3LTRNIIPF 
ESLTRMIVpF 



irssgssrvi 
irssgssrvi 
irssgssrvi 



ASGEKFIEGF 
ASGEKFIEGF 
ASGKKFIEGF 



QSTKLJcDPrA 
QSTKLJcDPrA 
QSTKLJcDPrA 



qPgQSSPkld 
qPgQSSPkld 
qPgQSSPkld 



fu/nigaeus 13073 
fumigazus 32722 
rumigatus 58128 
fumigatus 25906 
fumlgatus 32239 



KALARSWpF 
KALARSWPF 
KALARSWPF 
KALARSWPF 
XALAcSWPF 



IRASGSDRVI 
IRASGSDRVI 
IRASGSDRVI 
IRASGSDRVI 
rRSSGSDRVI 



ASGEKFIEGF 
ASGEKFIEGF 
ASGEKFIEGF 
ASGEKFIEGF 
ASGEKFIEGF 



QqAKLADPGA 
QqAKLADPGA 
QqAKLADPGA 
QqAKLADPGA 
QqANVAOPGA 



.TNRAAPAIs 
.TNRAAPAIs 
.TNRAAFAIs 
.TNRAAPAIs 
.TNRAAPVIs 



nidulans 
T. thermaphllus 
W. c/zersiopftila 

Consensus 
Conphya 



KNLARKnTPF 
KSLARNaVpF 
RALARKsIPF 



IRASGSORW 
VRCSGSDRVI 
VRTAGqDRW 



ASAEKFIMGF 
ASGrlFIEGF 
hSAENFTQGF 



RKAQLhDKGS 
QSAKVlDPhS 
KSA1LAORGS 



. .gQATPWn 
dKHDAPPTIn 
tVRPTLPydni 



KALARKIVPF IRASGSDRVI ASAEKFIEGF QSAKLADPGS - P HQ ASP VI- 
KALARKIVPF IRASGSDKVX ASAEKFIEGF QSAKLADPGS QPHQASFVTD 



A. cerrsus 9A-1 
A. Cerrsus cbs 



201 

ValPEGSAYN 
WIPSGTATO 



NTLEKS1CTA 
NTLERSICTA 



FES . . .StVG 
FEA. . .StVG 



ODAvANFTAV 
DAAaDNFTAV 



2S0 

FAPAIaQRLE 
FAPAIakRLE 



A. niger var. avamari WISEASSsN NTLDPGTCTV FED...SELA DTVEANFTAT F APS IRQ RLE 
A. niger T213 WISEASSsN NTLDPGTCTV FED...SELA DTVEANFTAT FAPSIRQRLE 

A. niger NRRL3135 WISEASSsN NTLDPGTCTV FED...SELA DTVEANFTAT FVPSIRQRLE 



A. fumigatus 13073 
A. r"uinig*seus 32722 
A. fumlgatus 58128 
A- fumigatus 26906 
A. fumigatus 32239 



VIIPESETFN 
VIIPESETFN 
VIIPESETFN 
VIIPESETFN 
VIIPESETYN 



NTLDKGVCTlc 
NTLDKGVCTk 
NTLDKGVCTk 
NTLDHGVCTk 
NTLDHSVCTN 



FEA. . -SQLG 
FEA. . -SQLG 
FEA. . .SQLG 
FEA. . .SQLG 
FEA. . .SSLG 



OEVaANFTAl 
DEVaANFTAl 
OEVaANFTAl 
DEVaANFTAl 
DEVEANFTAl 



FAPDIRARaE 
FAPDIHARaE 
FAPDIRARaE 
FAPDIRARaK 
FAPAIRARIE 



A. nidulans 

T. thermophilus 

M. thermophlla 

Consensus 
Conphya 



VIIPEiDGFN 
VIIeEGPSYN 
WIPETAGaN 



NTLDKSTCVS 
NTLDtGSCPV 
NTLHN DICTA 



FEN. . .DErA 
FED. • .SSgG 
FEEgpyStIG 



DEiEANFTAI 
HDAQEKFAkq 
DDAQDTY1ST 



MGPPIRIcRLE 
FAPAI1EKIK 
FAGPItARVN 



VIIPEGSGYN NTLDHGTCTA FED SELG DOAEANFTAT FAPAIRARLE 

VII5SGSC35H NTLDHGTCTA FED . . .SELG DDVEANFTAL FAPAIRARLE 



A. cerreus 9A-1 
A. terreas cbs 



A, niger var. awamo. 

A. niger T213 

A. niger NRRL3135 

A. fumigatzus 13073 

A. fumigatus 32722 

A. fumigatus 58128 

A. fumigatus 26906 

A. fumigatus 32239 



A. nidulans 
T. thermophilus 
«• thermophlla 

Consensus 
Conphya 



251 300 

ADLPGVqLST OOWnLMAMC PFETVS1TD OAhTLSPFCD 

ADLPGVqLSA DOWnLMAMC PFETVS1TD DAhTLSPFCD 

:1 NDLSGVTLTD TEVTyLMDMC SFDTIStST vDTKLSPFCD 

NDLSGVTLTD TEVTyLMDMC SFDTIStST vDTKLSPFCD 

NDLSGVTLTO TEVTyLMDMC SFDTIStST vDTKLSPFCD 

kHLPGVTLTD EDWsLMDMC SFDTVARTS DASQLSPFCQ 

kHLPGVTLTD EDWsLMDMC SFDTVARTS DASQLSPFCQ 

kHLPGVTLTD EDWsLMDMC SFDTVARTS DASQ ?f?~2 

kHLPGVTLTD EDWsLMDMC SFDTVARTS DAS 2rfn^? 

kHLFGVqLTO DDWsLMDMC SFDTVARTA OASSLSPFCA 

NDLPGIKLTN ENVIyLMDMC SFDTMARTA HG ^ E " 

DHLPGVDLAv SDVpyLMDLC PFETLARNh 

ANLPGANLTD ADTVaLMDLC PFETVAsSSs dpatadaggg NGrpLSPFCr 

ADLPGVTI/TO ' EDW-LMDMC PFETVARTS DATELSPFCA 

ADLPGVTE.XD EDWSLMDMC PFETVARTS DATELSPFCA 
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A. 
A. 
A. 
A. 
A. 
A. 
A. 
A. 
A, 
A. 
A. 
T- 



cerreus SA-1 
carreus cbs 



301 

LFTasEWtQY 
LFTaaEWtQY 



MYLISLDKYY 
NYL-1SLDKYY 



GYGGGMPLG? 
GYGGGNPLG? 



VQGVGWaMEL 
VQGVGWaMEL 



350 

MARLTPAPVH 
IARLTRSPVH 



niger var. avamoj 

nlger T213 

fi iger MRRL3135 



'i LFTKdEWiKY 
LFTHdSWiHY 
LTTHdZWiMY 



OYLQSLkKYY 
OYLRSLfcKYY 
OYLQSLkKYY 



GKGAGNPLG? 
GKGAGNPLGP 
GHGAGMPLG? 



TQGVGYaNEL 
TQGVGYaNEL 
TQGVGYaNEL 



IARLTKSPVH 
IARLTKSPVH 
IARLTKSPVH 



rumigacLrs 13073 
fumigacus 32722 
ruaiigatus 53128 
fumig-acus 26906 
fumigacus 32239 



LFTKnEWkKY 
LFTHnEWkKY 
LFTKnEWkKY 
LFTHnEWkKY 
IFTHnEWkKY 



NYLQSLGKYY 
NYLQSLGKYY 
NYLQSLGKYY 
NYLQSLGKYY 
DYLQSLGKYY 



GYGAGNPLGP 
GYGAGNPLGP 
GYGAGNPLGP 
GYGAGNPLGP 
GYGAGNPLGP 



AQGIGFtMEL 
AQGIGFtNEL 
AQGIGFtNEL 
AQGIGFtMEL 
AQGIGFtNEL 



IARLTRSPVQ 
IARLTRSPVQ 
IARLTRSPVQ 
LARLTRSPVQ 
lARLTnSPVQ 



nldulans 
chermopnllus 
eftermopftila 

Consensus 
Conphya 



IFTEkEWlQY 
LsTQeEWqaY 
LFSEsEWraY 



DYLQSL5KYY 
OYYQSLGKYY 
DYLQSVGKWY 



GYGAGSPLGP 
GaGGGMPLG? 
GYGPGNPLG? 



AQGIGFtNEL 
AQGVGFvNEL 
TQGVGFvNEL 



IARLTQSPVQ 
IARMTHSPVQ 
LARLAgvPVR 



LFTH-EW-OY DYLQSLGKYY GYGAGNPLGP AQGVGF-MEL IARLTRSPVQ 
LFTHDEWRQY DYLQSLGKYY GYGAGNPLGP AQGVGFANEL IARLTRSPVQ 



A. 
A. 
A. 
A. 
A. 
A. 
A. 
A. 
A. 
A. 
A. 
Z\ 



tarreus 9A-1 
tar re us cbs 



351 

DHTCVNNTLD 
DKTCVNNTLO 



ASPATFPLNA 
ANPATFPLNA 



TLYADFSHDS 
TLYADFSHDS 



NLVSIFWALG 
NLVSIFWALG 



400 

LYNGTAPLSq 
LYNGTfcPLSc 



nlger var. awamari 
niger T213 
nlqer NRRL3135 



OOTSSNHTLD 
DDTSSNHTLD 
DDTSSNHTLD 



fumigacus 13073 
fumigacus 32722 
fumlgatzus 58128 
fumigacus 26906 
fumigacus 32239 



SNPATFPLNS 
SNPATFPLNS 
SSPATFPLNS 



TLYADFSHDN 
TLYADFSHDN 
TLYADFSHDN 



GIISILFALG 
GIISILFALG 
GIISILFALG 



LYNGTkPLST 
LYNGTkPLST 
LYNGTkPLST 



DHTSTNsTLv 
DHTSTNsTLv 
DHTSTNsTLv 
DHTSTNsTLv 
DKTSTNsTLD 



nldulans 

chermophilus 

Chermophila 

Consensus 
Caaphya 



SNPATFPLNA 
SNPATFPLNA 
SNPATFPLNA 
SNPATFPLNA 
SDPATFPLNA 



TMYVDFSHDN 
TMYVDFSHDN 
TMYVDFSHDN 
TMYVDFSHDN 
TIYVDFSHDN 



SMVSIFFALG 
SMVSIFFALG 
SMVSIFcALG 
SMVSIFFALG 
GMIPIFFAMG 



LYNGTEPLSr 
LYNGTGPLSr 
LYNGTEPLSr 
LYNGTEPLSr 
LYMGTEPLSa 



ONTSTNHTLD 
DYTTVNHTLD 
DgTSTNRTLD 



SNPATFPLDr 
SNPATFPLNA 
GDPrTFPLGr 



KLYAOFSHDN 
TLYADFSHDN 
PLYADFSHDN 



SMISIFFAMG 
TMTSIFaALG 
DMMGVLgALG 



LYNGTQPLSm 
LYNGTAkLST 
aYDGVPPLDK 



DHTSTNHTLD SNPATFPLNA TLYADFSHDN SMISIFFALG LYNGTAPLST 
DHTSTNHTLD SNBAXFPLNA TLYADFSHDN SMISIFFALG LYNGTAPLST 



401 450 

A. Cerreus 9A-1 TSVESVSQTD GYAAAWTVPF AARAYVEMMQ C RAEKEP 

A. Car reus cbs TTVEDITrTD GYAAAWTVPF AARAYIEMMQ C RAEKQP 

A. niger var. at/amori TTVENITQTD GFSSAWTVPF ASR1YVEMMQ C QAEQE? 

A. niger T213 TTVENITQTD GFSSAWTVPF ASR1YVEMMQ C QAEQE P 

A. niger NRRL3135 TTVENITQTD GFSSAWTVPF ASR1YVEMMQ C QAEQE? 

A. fumigacus 13073 TSVESaKElD GYSASWWPF GARAYFEtMQ C ECS EKE P 

A. fumigatus 32722 TSVESaKElD GYSASWWPF GARAYFEtMQ C ...KSEKE? 

A. fumigatus 58128 TSVESaKElO GYSASWWPF GARAYFEtMQ C • . . KSEKES 

A. fumigacus 26906 TSVESaKElD GYSASWWPF GARAYFEtMQ C KSEBCEP 

A. fumicracus 32239 TSeSSTKESN GYSASWAVPF GARAYFEtMQ C KSEKE P 

A. nldulans DSVESIQEmD GYAASWTVPF GARAYFELMQ C E.KKEP 

X. Chermophilus TEIKSIEETO GYSAAWTVPF GGRAYIEMMQ C DDSDE? 

M. Chermophila TArrDpEEIG GYAASWAVPF AARiYVEKMR Csgggggggg gegrQEKDEa 

Consensus TSVESIEETD GYSASWTVP? GARAYVEMMQ C QAEKZ? 

Conphya TSVESIEETD GYSASWWPF GARAYVEMMQ C QAEKEP 
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A. 
A. 
A. 
A. 
A. 
A. 
A. 
A. 
A. 
A, 
A. 
T. 



cerrsus 9A-1 
carrst/5 cbs 



4S1 

LVRVLVMDRV 
LVRVLVMDRV 



M?LKGC?TDK 
M9LHGCAVDN 



LGRCKrOArV 
LGRCKrDDFV 



A.GL5 FAQAGG 
EGL5 "ARAGG 



500 

KWADCc- 

NWAECc 



nxgec var. awaraoj 
nlgar T213 
nlyer NR RL3135 



i LVRVLVMDRV 
LVRVLVNDRV 
LVRVLVNDRV 



VPLKGCPIDa 
VPLHGCPIDa 
VPLHGCPVDa 



LGRCTrOSFV 
LGRCTrDSEV 
LGRCTrOSFV 



rGLSFARSGG 
rGLSFARSGG 
rGLSFARSGG 



DWA£CsA 

DWASCFA 

DWAZCFA 



£wui?acus 13073 
fumigacus 32722 
fumigatus 58128 
fumlg-acus 26906 
fumiqratus 32239 



LVRALINORV 
LVRALMDRV 
LVRALIHORV 
LVRALINDRV 
'LVRALINDRV 



VPLKGCDVDK 
VPLHGCDVDK 
VPLKGCDVDK 
VPLKGCDVDK 
VPLHGCAVDK 



LGRCKLMDFV 
LGRCKLMDFV 
LGRCKLMDFV 
LGRCKLMDFV 
LGRCKLKDFV 



KGLSWARSGG 
KGLSWARSGG 
KGLSWARSGG 
KGLSWARSGG 
KGLSWARSGG 



KWGECFS 

NWGECFS- — 

NWGECFS 

MWGECFS 

NSEQSFS 



nidulans 

cherniophllus 

thezmophila 



LVRVLVMDRV 
WRVLVMDRV 
MVRVLVNDRV 



VPLHGCAVDK 
VrLHGCEVDS 
MTLkGCGADE 



FGRCTLDDWV 
LGRCKrDDFV 
rGMCTLErFI 



EGLNFARSGG 
rGLSFARcGG 
ESMAFARGNG 



NWkTCFTl— • 
NWEGCYAa.se 
KWD1CFA 



Consensus LVRVLVNDRV VPLHGCAVDK LGRCK-DDFV EGLSFARSGG NWAZCFA 

Conphys LVRVLVNDRV VPZJiGCAVDK LGRCKRDDFV EGLSFARSGG NWAZCFA 
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CP-1 

TATATGAACT^TCGGCGXGXTCGXCGTGCTACTGTCCATTGCCACCXXGTTCGGTTCCA 
1 h + ^ + + + ^ 

ATATACTmGTACCCGCACAAGCAGCACGATGACAGGTAACGGTGGAACAAGCCAAGGT 

CATCCGGTACCGCCXTGGGTCCXCGTGGTAATTCTCACTCTTGTGACACTGTTGACGGTG 

Si , 

+ ^ + + + t* 120 

GTAGGCCAXGGCGGAACCOVGGAGCACCA^ 
CP-2 

CP-3 

GTTACCAAXGXXXCCCAGAAAXXXCXCACXXGXGGGGXCAAEACXCXCCA^ 
121 , + „ , + + + lfl0 

CAAXGGXXACAAAGGGTCXXTAAAGAGXGAACACCCCA.GTTATGAGAGGTATGAAGAGAA 

XGGAAGACGAAXCXGCXAXTXCXCCAGACGTTCCAGACGACTGTAGAGTTACTTTCGTTC 
181 + + .._.„ + + + 24Q 

ACCTTCTGCTTAGACGMIAAAjGAGGXCXGCAAjGGTCXGCX 
CP-4 

CP-5 

AAGTXXXGTCTAGACACGGXGCIAGAXACCCAAGXXCXTCTAAXSICXAA^ 
241 + + _ + „„ + .„. + + 3Q0 

XXCAAAACAia\^CXGXGCCACGAXCXAXGGGXXGAAjGAAGATTC^GATTCCGAATGAGAC 

CXXTGAXXGAAGCXAXXCAAAAGAACGCIACIGCXIXCAAGGGXAAGTACGCTTXCTTGA 
301 + __ + + + + _ + 3g0 

GAAACXAACTTCGATAAGXXXTCXXGCGAXGACGAAAGXXCCCAXXCAXGCGAAAGAACX 

CP-6 

CP-7 

AGACrTACAACTACACXXTGGGXGCXGACGACXXGACXCCAXXCGGXGAAAACCAAAXGG 
361 + + + + + + 42Q 

XCXGAAXGXXGAXGXGAAACCCACGACXGCXGAACXGAGGXAAGCCACXXXXGGTXTACC 

XXAACXCXGGXAXXAAGXXCXACAGAAGAXACAAGGCXXXGX^XAGAAAGAXXGXXCCAX 
421 + + + + + f ^ 48Q 

AAXXGAGACCAXAATTCAAGATGXCTXCXAXGTTCCGAAACCGAICXXXCXAACAAGGXA 

CP-8 

CP-9 

TCATTAGAGCTTCTGGTTCTGACAGAGTTATTGCTTCTGCXGAAAAGXXCAXXGAAGGXX 
481 -H" + + + + + 540 

AGXAAXCXCGAAGACCAAGACXGXCXCAAXAACGAAGACGACXXXXOVAGXAACXXCCAA 
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TCCAATCTGCXAAGXXGGCXGACCCAGGXICXCAACCACACCAAGCIXCXCCAGXXAXXG 

541 -r h -r + + + 600 

AG G TT AG AC GAT T C AAC CGACTG GGT C O AAG AGT TGGTGTGGTTC GAAGAGG X CAAT AAC 

CP-10 

CP-11 

ACGXXATTATTCCAGAAGGaTCcGGTTACAACAACACTTTGGACCACGGTACTXGXACIG 

601 + + + -r + -r 660 

TGCAAXAAXAAGGXCXXCCtJ^COVAXGXXGXXGXGAAACCXGGXGCCAIGAA^ 

CXTXCGAAGACXCXGAAXTGGGXGACGACGXXGAAGCTAACTTCACXGCXXXGTTCGCXC 

6S1 + + + + + + 720 

GAAAGCXXCXGAGACTTAACCCA.CTGCTGCAACTTCGATTGAAGTGA.CGAAACAAGCGAG 

CP-12 



CAGCXAXTAGAGCXAGATTGGAJIGCTGACTTGCCAGGTGTTACTTTGACTGACGAAGACG 

721 + + + +- 780 

GTCGAOLAATCXCGATCTAACCXTCGACTGAACGGTCCACAATGAAACTGACTGCXXCTGC 

CP-13 

XTGXTTACXXGAXGGACAXGXGXCCAXTCGAAACXGXXGCTAGAACXTCTGACGCXACXG 

781 + + + + + + 840 

AACAAAIGAACXACCXGXACAC AGGTAAGCT TTGA.CAACGATCTTGAAG ACT G C GAT GAC 

AAXXGTCXCCAIXCTGXGCXTXGXTCACTCACGACGAATGGAGACAATACGA.CTACTTGC 

841 + + 4- * + + 9°° 

TTAACAGAGGXAAGACACGAAACAAGXGAGXGCXGCXTACCXCX 
CP-14 

CP-15 

AATCTTTGGGXAAGXACTACGGXTACGGXGCXGGXAACCCAXXGGGICCAGCTCAAGGXG 

901 + + + + + + 960 

XIAGAAACCCAJTCAIGAIGCCAAXGCCAGGACCATTGGGTAACCCAGGTCGAGTTCCAC 

XXGGXTTCGCXAACGAAXTGAXXGCTAGAXXGACXAGATCTCCAGTTCAAGACCACACTT 

961 + + + + + + 1020 

AACCAAAGCGATTGCXXAACIAACGAXCXAACXGAXCXAGAGGTCAAGXXCXGGXGXGAA 

CP-IS 

CP-17 

CTACTAACCACACTTTGGACXCXAACCCAGCTACTXXCCCAXXGAACGCTACXXXGTACG 

1021 r — + * + + 1080 

GAX GAXXGGTGT GAAACC EGAGAT X GGGXCGAXGAAAGGGTAACT TG CGAT G.AAAC AT G C 



Pin QF 
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CXGACrTCTCTCACGACAACTCTATGAXTTCTATTXTCITCGCTTTGGGTTTGTACAACG 

ioai + . 

+ + + +■ 1140 

GACTGAAGAGAGTGCTGTTGAGATACEAAAGAIAA^ 

CP- 18 

CP-19 

GTACTGCTCCATTGTCTACTACTTCTGTTGAA^CTMXGAAGAAACTGACGGTTACTCXG 

1141 + , . ^ ^ ,„ rtrt 

+ + + + + i2QQ 

CATGACGAGGTAACAiSAXGAXGAAGACAACXXA^ 

CXXCXTGGACXGXXCCAIXCGGXGCOMAGCXXACGXXGAAAXG^XGCAAXGXC^ 

1201 + + , ■ . ^ ^ 

+ + + + - + 1260 

GAAGAACCTGACAAGGTAAGCCACGAXCTCGAATGCAACXXtCACTAGGXXAGAGXXCGA^ 

CP-20 

CP-21 

AAAAGGAACCAXTGGTTAGAGTXTTGGTTAACGACAGAGTXGXXCCAXXGCACGGXXGXG 
1261 _ + + + + + 1 132Q 

XXXXCCTTGGIAACCAAXCXCAAAACCAAXXGCXGXCXCAACAAGGXAACGXGCC^^ 

CXGXXGACAAGXXGGGXAGAXGXAAGAGAGACGACXXCGXXGAAGGIXXGXCXIXCGCXA 
1321 . . + + + _ + 138Q 

GAC^CTGXXCAACCCATCTACATTCTCTCTGCTGAAGCA^^ 

CP-22 



GAXCXGGTGGTAACTGGGCTGAAXGTXXCGCTTAAGAAXTCATAXA 
+ 4- + + 

CXAGACCAGCAXXGACCCGAGXXACAAAGCGAAXXCXXAAGXATAX 
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1 TCTGTAACCGATAGCGGACCCACTAGGC^^ SQ 

SI GAC\ATGCAACTCAGTCGAATATGAAGGGCTA^ 1QQ 

X 0 1 «jCCGTCTAGGTCGSGCrcCGGG^ • 150 

XSL TTCGw i CAXGGv- j. ' L V i ' L"rC^CGGTCGCTCTTrCGCTTTAXTACIlLGCr AT 200 

MAFFTVAL SLYYLLS 15 

201. CGAGgugagacccccacaacacccgcccgcccagcccaaccggcacccac 250 

R IS 

25 1 CCgcacagACTCTCTCXnCACGCCCCAOTOGrCCA^ 300 

VS AQ APVV'QMHSCN 30 

301 TACGGCGGACGGTGGATATCAATGCTTCCCCAATGTCTCTC^ 350 



TADGGV. QC F PtfVSKVWG 



47 



35! GTCA^TACTCGCCGrACTrcrCCATCGAGCAGGAGTCAGCTATCrCT 400 
Q^SPY FSI£Q£SAISE fi3 



401 GAGGTGCCTCATGGCTGTGASGTTAC C jl V1U TGCAGGTGCT C TCGCGGCA 450 
DVPKGCEVTFVQ VLSRK 80 



4 5 L TGsiuGCTACOTATCaSACAGAGrro 500 
GARYPTES KS 2C A Y S G L I 37 



SQL TTGAAGCAATCCAGAAGAATGCTACCTi L 1 1 11 IUG GGACAGTATG L I ' l IT 550 

EAIQKNAT S FWGQYAF 113 

• • • • . 

S 5 1 CTGGAGAGTTATAACTATACCCTCGGCGCGGATGACTTGACT A I L 11L GG 500 

I. 2SYNYTL GA DDLTI FG 130 

♦ 

S Q 1 CGAGAACCAGATGGTTCAirCGGGTGCCAAGTTCTACCGACGGTATAAGA 650 

SfcfQMVDSGAK F Y R R Y K N 147 

• • • • . 

S3 1 ATCTCGCCAGGAAAAATACg O.1 1 1 1 A TCCGTCGVrCAGGGTCTGACCGT 700 

tARKNTPFIRAS-OSDR 1S3 
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• • • • 

701 GTCGTTGCGTC7GCGGAGAACrrLC\TT^ 750 

VVA.SAEKFINGFRKAQL 130 

• • • • 

7Si CQCGACcaiGGcrccaAAastGciACGCca b l *u l cutctgattatcc a a a 

KDKGSXRATPVVMVII? 197 



* • • • • 

a a i CTOAAATOSAiGsnrrjuca^^ a s a 

E IDG FNKTL DKSTCVS 213 



flSl TTTGAGAATGATGACCCXKC^ 900 
FEtfDE'SiAD E I E A N F T A I 230 



• • • • * 

901 TATO3GACCTCCGATCCGakAACGT 350* 
MGPPIRKRLEtfDLPGIK 247 



• • • • • 

351 AACTTAaVAACGAGAATCT^ 1000 
LTNENVXY&MDttCS F D 2S3 



• • ■ • • 

TMARTAKGTELSPFCAI 280 



• • • • • 

"SI CTT^CTGAAAA^ACTT TO^ rATCAA 1100 

FTEKEWLQ YD VLQS LSK 297 

• . • — • • • 

nai ^r^sn^GGCTACcxrrGcax^^^ nso 

rYGVGACS PLGPAQGI 313 

1151 GGCrrCACCAACGAGCIGAriGCCCCACTAAGGCAATCGCCCGTCCAGGA 1200 

GFTtfEL IARLTQSPVQD 330 

1201 C^CACVAGCACCAACCACACTCT 12 SO 

NTSTNKTLDSNPATFPI* 347 



• • • • * 

12S1 TCGACAGGAAGCTCTACGCCGA L. l i C L L CCACGACAATAGCATGArATCG 13 00 

DRKL1fADF.SKD6tJSK.gIS 3 53 

* • . • 

13 01 ATATtCrrCGCCATGGGTCrGTA^ 13 30 

I FFAMG LYNGT QPL SMD 330 
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1351 TTCCGTGGAGTCGATCCAGGAGATGGACGGTTACGCGGCGTC^ 1400 
SVESI Q EM D G.YAASWTV 337 



14 Q I TTCCG i j. Llru tUCGAGGGCTTAL t V ^UAGCTCATGCAGTGCGAGAAGAAJS 1450 
P ? G A R A Y F ELMQCESCX 413 



1451 GAGCCCl j. lUlliCGGGTATTAGTGAATGArCGC C l iG i. i CCTCTTCATGG 1500 
EPLVH.VI.VNDRVVpr.KG 43 0 



15 0 L CTGCGCAGTTGACAAG * j. i.OGACGGTGCACI L L GGACGATTGGGT.^GAGG 155 0 
CAVDK?.G**R CTLDDWVEG 447 



1551 GCXTGAnTTXTGCAAGGAGCGGCGGGAACrGGArtGACTTGITTTACCCrA Iff 0 0 
LNFARSGG NWKTCFTL 463 



IS 01 TAAAGGGC G i.uuv,i LA TTCAXAA G 1 1G lG CAGGTATAGGAAGGTTAG Iff 50 

1 S 5 1 GGAATTAG CVu 1 j> iliGvl j. i lA GrCTTATTAGACCftACAArG Ai I xU 1 1700 

1701 VrClCAAGGCCrTCTAGCATATCGTCAAGTGGG^ L750 

• • • , • • • 

1751 C A T G T G iA GCTGAACCCC C il! iU CATCIACCT Ljl ib iC l 1 1LA GAGTAG 18 0 0 

1801 TTXC^CCAAACATATCCrCGXGrCCTCr Cl iC £G CTCITCGGTCrCATAT 1850 

V • • • * 

18 51 TACA C ili k I CTCTATCTArATCGTCAACAAAACTACCACCCAAACACCAA 1900 
1301 ATGTCACACTkTCCAGCACGAA Ar 1 111 iC C 1931 
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1 ATCmiCTCTCTCill'bTlVia iT^^ 
23MCVSA VLLP LTLLSGVTSQL 



61 GC AGTCCCCGCCTC GAGAMTCMTCCAGTTGCQATACGGTCGATCAGGGGTATC AATGC 

AVPASRNQSS'CDTVDQOYQC 
-1*1 

121 TTCTCCGAGACnTCGCATCTTltSGGGTCAATACGCACC^^ 
18 FSETS HLWGQYAPPFSLANE 



181- TCGGTCATCTCCCCTQAGGrQCCCGCCGGATGCAGAGTCACrnX!GCTCACGTC 
38 SVIS PEVPA GCHVTFAQV LS 



2*fl CGTCATGOACfcGCGGTATCCaACCGACTCCA^ 
58RHGAHTPTPSKGKXYSALIE 



301 GAGATK5AGCAGAACGCGACCACCTXTGACGGAAAATATGCCT 
78EIQQM.ATTPD GKY.AFLKTYH 



361 TACAGCTTGGQnnCKAGATGAariXUCTCCCTTCQGAQMCAGC^ 
90YSLGADDZ.TPFGBQSLVNSG 



421 ATCAAQTTCTACWAGCGOTACaAATCGCTCACAAGaAACATCG^ 
118IKPYQ HYB.SLTRNIVPPIRS 



J81 TCIGGCTCGAGCCGCGTGATCGCCTCCGGCAAGAAAT^ 

138SGSSRVIASGKKFIEGFQST 



5^1 AAGCTGAAGaATCCXCGTGCCCAGCCCGGCCAATCGTCGCCCAAGATCGAC^ 

158 KLK'DPRAQPGQSSPKXDVVZ 



601 TCCQAGGCCAGCTCATCCMCAACACTCnCQACCCAGQCACCTQCACTI^^ 

178 SEASSSKHTLDPGTCTVPED 



661 AGCGMlTGGCCGATACCGlXJQAAGCCAATITCACCGCCACGTrCGTCCCCTCCATTCCT 
198 S E L A VBANFTATFVPSIR 



721 CAACGTCTCGAGAACCLACCriXr^ 

218 QRLBNDL SGVTLTDTEVTYL 



781 ATGQACATGTGCTCCTTCGACACCATCTCCACCAGCACCGTCGACACCAAGCTGTC 
238 MDHCSPD TISTSTVDTKLSP 
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84l TTCTQTGACCTGITCACCCATGACGAATGQATCAACTACGACTACCTCCAGTCCTTGAAA 
258 FCDLFTHDEW I NYDYLQSLK 



901 AAGTATTACGGCCATGGTGCAGGTMCCCGCTCGGCCCGACCCAGGGCGTCGGCTACGCT 
278 KYYGHGAGNPLGPTQGVGYA 



961 AACGAGCTCATCGCCCGTCTGACCCACltJGCCrGTCCACGATGACACCAGTrCCAACCAC 
298 NE LIARLTHSPVHDDTSSNH 



1021 ACTrrGGACTCGAGCCCGGCTACCTTTCCGCrrcAACTC^^ 
318 TLDSS ? A T F P I.KST LYADFS 



1081 CATGACAACCKSCATCATCTCGATECTCTTTGCTTTAGGTCTG^A 
338 HDNGI I S'l-L F A EG L'T N GTK P 



11^1 . CTATCTACCACGACCGTGGAGAATATCACCCAGACAGATGGATTCT 
358 LSTTTVE NITQTDGFiSSAWT 



1201 GTItlCGITnxrCTCGCGTITGrACGTCGAGA 
378 VPFAS B LYVE MM QCQAEQEP 



1261 CTGCrrCCGTGTCTTGGTTAATGATCGCGTTGTC 
398 LVRVLVNDRVVPLHGCPVDA 



1321 TIGGGGAGATGTACCCGGGATACOTITGTGAGGGGarrG^ 
4l8 LGRCTRDSFVRGLSFARSGG 



1381 GATTGGGCGGAGTGTTTTGCTTAG 
^38DWAECFA» 
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tctagaacaataacaggtactccctaggtaccegaaggaccttgtggaaaatgtatggag 60 

gtggacacggcaccaaccaccacccgcgatggcgcacgtggtgccctaaccccttgctcc 120 

ctcaggatggaatccafcgtcgactctfctaeccteaccatcgcctggatgaaacctccccg 180 

etaagctcacgacgatcgctatttccgaccgatttgaccgtcatggtggagggctgattc 240 

ggtcgatgctcctgccttcatntcggagtitcggagacatgaaaggcttatatgaggacgt 300 

cccaggtcggggacgaaatccgccctgggctgtgctccttcgtcggaaacatctgctgtc 360 

cgtgatggctaccatgggctttcttgccattgtgctctccgtcgccttgctctttagaag 420 

M G F L A I VIi'S V A L L F R S 16 

gtatgcacccctctacgtccaattctctgggcactgacaacggcgcagcacatcgggcac 480 

T S 6 T 20 

cccgttgggcccccggggcaaacatagcgactgcaactcagtcgatcacggctatcaatg 540 

PLGPRGKHS D CN SVDHGYQC 40 

ctttcctgaactctctcataaatggggactctacgcgccctacttctccctccaggacga 600 
FPELSHKWGLYAPYFSLQDE 60 

gtctccgtftcctctggacgtcccagaggactgtcacatcaccttcgtgcaggtgctggc 660 
S P F P L D V P E . D C H I T FVQVL A 80 

ccgccacggcgcgcggagcccaacccatagcaagaccaaggcgtacgcggcgaccattgc 720 
RHGARSPTH S KT KAYAAT IA 100 

ggccatccagaagagtgccactgcgtttccgggcaaatacgcgttcctgcagtcatataa 780 
AXQKSATAFPGKYAFX.QSYN 120 

ctactccttggactctgaggagctgactcccttcgggcggaaccagctgcgagatctggg 840 
YSLDSEELTPFGRNQLRDLG 140 

cgcccagtrtctacgagcgctacaacgccctcacccgacacateaaccccttcgtccgcgc 900 
AQFYERYNALT RHINPFVRA 160 

caccgatgcatcccgcgtccacgaatccgccgagaagttcgtcgagggcttccaaaccgc 960 
TDASRVHESAEKFVEGFQTA 180 

tcgacaggacgatcatcacgccaatccccaccagccttcgcctcgcgtggacgtggccat;^ 1020 
RQDDHHANP HQ P SPRVDVA I 200 

ccccgaaggcagcgcctacaacaacacgctggagcacagcctctgcaccgccttcgaatc 1080 
PEGSAYNNT L E H S1CT AF ES 220 

cagcaccgtcggcgacgacgcggtcgccaacttcaccgccgtgttcgcgccggcgatcgc 1140 
S X VGD DAVA N F T AV F A P A X A 240 

ccagcgcctggaggccgatcttcccggcgtgcagctgtccaccgacgacgtggtcaacct 1200 
QRLEADLP G V.Q L S TD DVVNL 260 

ga-tggccatgtgtccgttcgagacggtcagcctgaccgacgacgcgcacacgctgt cgcc 12 60 
HAMCPFETVS I*TDDAHTLSP 280 

gttctgcgacctcttcacggccactgagtggacgcagtacaactacctgctctcgctgga 1320 
FCDLFTATEWTQYNYLLSLD 300 

caagtactacggctacggcgggggcaatccgctgggtccggtgcagggggtcggctgggc 1380 
XYYGYGGGNP LGPVQG VGWA 320 

gaacgagctgatggcgcggctaacgcgcgcccccgtgcacgaccacacctgcgtcaacaa 1440 
NELMARLTRA P VH DHTCVNN 340 

caccctcgacgcgagtccggccaccttcccgctgaacgccaccctctacgccgacttctc 1500 
^LDASPATFPIiNATLYADFS 360 
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ccacgacagcaacctggtgtcgatcttctgggcgctgggcctgrtacaacggcaccgcgcc IS CO 
HDSNLVSIFWALGL.YNGT.AP 380 

gctgtcgcagacctccgtcgagagcgtctcccagacggacgggtacgccgccgcctggac 1620 
LSQTSVESVSQTDGYAAAWT 400 

ggtgccgttcgccgctcgcgegtacgtcgagatgatgcagtgtcgcgccgagaaggagcc 1680 
VPFAARAYVEMH QCRAEKEP 420 

gctggtgcgcgtigctiggtcaacgaccggg^catgccgctgcatggctgccctacggacaa 1740 
IiVRVLVNDRVMPIjHGCPTDK 440 

gctggggcggtgcaagcgggacgctttcgtcgcggggctgagctttgcgcaggcgggcgg 1800 
LGRCKRDAFVAGLSFAQAGG 4E0 

gaactgggcggattgtttctgatgttgagaagaaaggtagatagataggtagtacatatg 18 SO 
H W A D C F 4ff & 

gattgctcggctctgggtcgttgcccacaatgcatattacgcccgtcaactgccttgcgc 1920 
catccacctctcaccctggacgcaaccgagcggtctaccctgcacacggcttccaccgcg 1980 
acgcgcacggataaggcgcttttgttacggggttggggctgggggcagccggagccggag 2040 
agagagaccagcgtgaaaaacgacagaacatagatatcaattcgacgccaattcatgeag 2100 
agtagtatacagacgaactgaaacaaacacatcacttccctcgctcctctcctgtagaag 21S0 
acgctcccaccagccgcttctggcccttattcccgtacgcraggtagaccagtcagccag 2220 
acgcatgcctcacaagaacgggggcgggggacacactccgctcgtacagcacccacgacg 2280 
tgtacaggaaaaccggcagcgccacaatcgtcgagagccatctgcag 2327 



12B 



WO 99/49022 



PCT/DK99/00153 



39/51 



si Aactraccraicrra^^ iao 
iai TOieocwiaaac^^ 1S0 

. 2Qi oeTTOacaictosciG^ 23Q 

M S X. £, L S 

301 *Gfl^C*CTC^^ 

=> c ^" caca =C^CCCccgccaacgcccccacaacc?aasrcrrcrCAAGAA 400 

V S ft M 20 

401 ATCCGCAXCrrrGATAGCCACrCTTCCAATA^^ 4S0 
P "VDSHSCNTVEGGYQ 3S 

CRP *ISKSWCQYSPFFS 53 

SOI C^CAGACCSGTCaacilTOGCCAG^CXCCC^ SS0 
^ADQSEISP-DVPQNCKI 70 

551 TTAC^ , , t-CAGCTCC U Ll_ L LL, A CACGGCGCTAGATACCCTACGrCT S00 

~ pvqlls r kgarypts ae 

5 KTE LYSQL ISRIQ KTA 103 

^51 GACTGCGTACAAAGGCTACTATGCCTTCTTCAAAGACTACAGATA 700 

'.AYKCYYAFLKDYRYQL 120 

701 TGGGAGOGAAOTACCTGACGCCCTITCmGGAAAACCAGATGATCCAGTTG 750 

GAMDLT??GSNQMIQC 13 



Fig. 13A 
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• • • • • ■ 

751 GGCVTCAAU u 1 1 1 A7AACCATTACAAGAGTC7CGCCAGGAATGCCGTCCC 8 0 0 
C IKFYNKYKSLARHAVP 153 



• • • • • 

B Q 1 ATrCGTrCGrrGCTCCGGCTCTGASCGGGrCATro 8 50 

FVRCSGSO R V I A S G R L F 170 



• • • • • 

8 S X TCATCGAAGGTTTCCaGAGCGCCAAAGTC 300 

I E G F Q S AKV LD ?K.S D K iaS 

• • * * • 

301 CATGACGCTCCTCCC^CGATC^ACGItaAT^ 350 

HDAPFTINVIIEEGPSY 203 



• • • • • 

351 CAATAACACGCTCGACACCGGC^GCrGTCCAGTL i l i GA GGACAGCAGCG 1000 
MNTLDTGSC PVFEDSSG 220 



• • • • • 

1001 GGGGACATGACGCACAGGAAAAGTTOT 1050 

GKDAQEKFAKQFAPAI 23S 

• • • * • 

105 1 CTGGAAAAGATCAAGGACCATCITCCCGCCGTGGA^ 110 0 

I* E K I ' K D KLP GVDLAVSD 253 



• • • • 

1101 TGTACCGTACTTGATGGAl. 1 IL, l GTCC l; l l ilG AnACCTTCGCTCGCAACC 1 1 S 0 
* VPVLMDLC5 FETLARWK 270" 



• • • • » 

US! ACA&AGACACGCTCTCTCC Cr I il L ( jCC^r rC TTTCCACGC AAG AGCAGTOG 1200 
TDTLS P FCALSTQ EEW 285 



• • « • • 

1201 C AAG CAT ATG ACT ACT A C CIA^GTCTGGCG AAAT ACT ATGG CAATGGCGG 1250 

jQAYDYYQSLGKYYGMGC 303 

• ♦ 

1251 GGGTAACCCGTTGGGGCCAGCCCAAGGCGTGGGGTTTGTCAACGAGTTGA 1300 

GNPLGSAQGVGFVMELr 320 

1 3 C 1 TTGCTCGCATGACCCATAGCCCTGTCCAGGACTACACC\CGGTCAACCAC 13 50 

A R M T K S ? V Q D Y T T ' V M K 3 35 

• • • • ■ 

1351 ACTCTTGACTCGAATCCC<jCGACATTCCCTTTGAACGCGACGCTGTACGC 1 * 0 0 

TLDSUPATF PLWA7LYA 353 
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"01 AGATTTCAGCCACCACAACAC^^ 14SQ 

OFSKDNTMTS IFAALGL 370 

• . . 

" 51 TCTACAACGGGACCGCGAACCX^^ 1SQQ 

YWGTAKLSTT2IKSIE 33S 

ETDGYSAAWTV PFGGRA 403 

* * * • 

ISSX CTATATCGAGATGAZGCAGTGTGATGATTCGGATGAGCCAGTCGT7CGGG 1500 

VIBMMQ CD 0 SDEPVVRV 420 

"01 TGCTGGTCAACGACCGGGTGGTTCCACTGCATGGCTGCGAGGTGGACrCC 1S50 

LVJTDR VVPLHGCEVDS 43S 

Iff 5 1 CTCGGGCGATGCAAfl^lAGACGACTTtOT 17 00 

LGRCKRDDFVRGI.S?AR 4S3 



1701 ACAGG3TGGGAACIGGGAGGGGItnTACG C UK! I fcCiXj AGTAGGTTTATT 17S0 
QGGNW E GC VAASS » 4Sfi 

1751 CP^GACTrrCGACCXTTCrATCCTTCAAACACrGCACAAAGAC^ 1800 

• • . . 

1301 ^TOAAATGGTAACAGGCCTGGAGCGTITEVGAAGGAAAAAAGTr 18 43 
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A&SSIGGGCAAACTCATCATGCTC^TCTTGATGATTCCIACTGTTCMCTACCTGGCTGCTGCTTCTCTi5IGGGTTl 
8 0 - 

Hindlll M L I L M I P Ii FS Y L A A A S L 



CATC 



CTTTGCCCCTOTCTCGATGTTAAAATACTAAACATATTTCACCAGAOGTGTA 
•Lou ~™ 

RVLSPQPVSCD 

GCCCGGAGCTO^ 

SPELGYQCDQQTTHTWG'QYSPPFSVPS 
^^TCTCCCCTTCCGTTC 

EISPSVPDGCRLTFAQVLSRHGARPPT 

CCCGGGTAAAGCCGCCGCCATCTCCGCTGTCCTCACCAAAATCAAAACCTCTGCCACCT 
400 



PGKAA AI SAVLTKI KTSAT 



Y G S D P Q 



TCATCAAGAACTACGACTATGTACTTGGCGTAGACCACCT 
480 

F I KNYDYVLGVDH LTAFGEQEMVWS GI 

AAGTTCTACCAGCGCTACTCCTCCCTCATCCAGACAGAAGACrrCGGAT^ 
560 



KFYQRYSSL 



SDTLPFVRASGQE 



ACGCGTCATCGCCTCCGCCGAGAACTTCAC^ 
640 

RVIASAENFTTGFYSALSADKNPPSS 

TACCAAG^CCAGAAATGGTCATCATTTCTGAGGAGCCAAC^ 
720 

IiP RPEMVIISEE PTANNTMHHGLCRSF 

GAAGATTCCL\CCACCGGCGACCAAGCCCAAGCGGAATTCATCGCCGCCACCTTCCCACCCATCACC^ 
800 

EDSTTGDQAQAEFIAATFPPI TARLNA 

CCy^GGTTTCAAAGGCGTCACCCTCTCCAACACCGACGTC 
880 

Q GFKGVTLSNTDVLS LMDLCPFD TVA 

ACCCCCTTTCCTCCCTCACCACCACCTCTTCCGTTTCTGGAGGCGGCA^ 
960 

YPLSS LTTTS SVSGGGKLS PFCS L FTA 

AGCGACTGGACAATCTACGATTACCTCCAGTCCCTAGGGAAAT^ 
1040 

SDWTIYDYLQSIiGKYYGFGPGNSLAAT 

CCAGGGGGTAGGGTACGTCAACGAGCTTATCGCCTO^ 
1120 

QGVGYVNELIARLI RAPVVDHTTTNS 

CTCTOGATGGCGACGAAAAAACGTTTCCGTTGAACAGAACGGTGTATG 
1200 

TLDGDEKTFPLNRTVYADFSHDNDM MN 
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ATCCTGACTGCTTTGCGGATATTCGAGCATATCAGTC 
1280 

I LTALRIFEHI S PMDNTTI PTNYGQTG 

AGATGACGGGGTGAAGGAAAGGGATTTGTTCAAGGTTAGTTGGGCGGTGCCCTTTGCTGGGAGGGTGTACTTT 
1360 

DDGVKER DLFKVSWAVPPAGRVYFEK 

TGGTTTGTGATGCGGATGGGGATGGCAAGATTGATAGTGATGAGGCT^^ 
1400 

MVCDADGDGKID SDEAQKELVRILVND 

CGGGTGATGAGATTGAATGGGTGTGATGCTGATG^CAGGGTAGGTGTGGATTGGAGAAGTTTCT 
1520 

RVMRLNGCDADE QGRCGLEKFVESMEF 

TGCGAGGAGAGGGGGGGAGTGGGAGGAGAGGTGTTTTGTTTAG CTCTAGA 
ARRGGEWE ERCFV Xbal 
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1 . •MLILMIPLFSYLAAASLRVLSPQPVSCDSPELGYQCDQQTTHTWGQYS 48 

1 III. |. . I I . I : I I : I I : . . . | | I | | | 

1 MTGLGVMVVMVGFLAIASLQSESR. . . PCDTPDLGFQCGTAISHFWGQYS 47 
• 

49 PFFSVPSEISPSVPDGCRLTFAQVLSRHGARFPTPGKAAAISAVLTKIKT 98 

|!lMII|:.:|:||:| : I I I I I I I I I I I I II :||.. 
4 8 P YFS VPS ELDAS I PDDCEVTFAQVLS RHGARAPTLKRAAS YVDLI DRI HH 97 
• 

99 SATWYGSDFQFIKNYDYVLGVDHLTAFGEQEMVNSGIKFYQRYSSLIQTE 148 

:| --H.:::|::.|||-i|.|.|| I : I : I I I I I I I I I . ! I . . I 
98 GAISYGPGYEFLRTYDYTLGADELTRTGQQQMVNSGIKFYRRYRAL 143 

14 9 DSDTLPFVRASGQERVIASAENF.TTGFYSALSADKNPPSSLPRP . EMVII 1 97 

: I I I I . . I I : II : I I I I I I I I . I I I I I : . • I : I | : | 

14 4 ARKS I PFVRTAGQDRWHSAENFTQGFHSALLADRGSTVRPTLPYDMWI 193 
• 

198 SEEPTANNTMHHGLCRSFED. . . STTGDQAQAEFIAATFPPITARLNAQG 244 

•l- : -IN|:|::|| .||: I I . | | : | | . . : : . . . |||||:|| . 
194 PETAGANNTLHNDLCTAFEEGPYSTIGDDAQDTYLSTFAGPITARVNA.N 242 

245 FKGVTLSNTDVLSLMDLCPFDTVAYPLSSLTTTSSVSGGGK.LSPFCSLF 293 

: * I • • I • : • I • : • I I M I I I : I I I . I . . ! : I . I : I I I I I * I I 

243 L PGANL T DADT VALMDLC P FET VAS S S S DP ATADAGGGNGRPLS P FCRL F 292 

294 TASDWTIYDYLQSLGKYYGFGPGNSLAATQGVGYVNELIARLIRAPWDH 343 

-> : l M II | |||:| || . | | J 

293 SESEWRAYDYLQSVGKWYGYGPGNPLGPTQGVGFVNELL71RLAGVPVRDG 342 

* 

344 TTTNSTLDGDEKTFPLNRTVYADFSHDNDMMNILTALRI FEHISPMDNTT 393 

1*1 M Ml I M.I.MI I I II I I I I I .:| . || :: :.|:| 
34 3 TSTNRTLDGDPRTFPLGRPLYADFSHDNDMMGVLGALGAYDGVPPLD . . . 389 

• • • 

394 I PTNYGQTGDDGVKERDLFKVSWAVPFAGRVYFEKMVCDADGDGKIDSD . 442 

• I 5 s • - I : : • I I I I II I : I : I . I I I I . : : I : I : : : 
390 KTARRDPEELGGYAASWAVPFAARIYVEKMRCSGGGGGGGGGEG 433 

4 43 . . EAQECELVRILVNDRVMRLNGCDADEQGRCGLEKFVESMEFARRGGEWE 490 

I :-l:M:IIIMM I . I I : I I I . I . I . I I : I : I I I . I I I .|.|: 
434 RQEKDEEMVRVLVNDRVMTLKGCGADERGMCTLERFIESMAFARGNGKWD 483 



4 91 ERCFV 495 
II. 

484 L.CFA 487 
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Peniophora numbers 1 

Alignment numbers 1 

P_ involtus__Al ML FGFVALACLL 

P_ involtus_A2 MH LGFVTLACLI 

T_pubescens MAFSILASLL 

A_pediades MSLFIGGCLL 

P_lycii MV SSAFAPSILL 

A_fumigatus MVTL TFLLSAAYLL 

consphyA MGVF VVLLSIATLF 

A_nidulans MAFF TVALSLYYLL 

A_ficuum_NRRL3135 MGVS AVLLPLYLLS 

A_terreus MGFL AIVLSVALLF 

T_thermo MSLL LLVLSGGLVA 

T_lanuginosa MAGIGLGSFL VLLLQFSALL 

M_thermophila MTGL GVMWMVGFL 

Cjfoecundissimum ML ILMIPLFSYL 



37 
50 

SLSEVLATSV P KNT APTFPIPESE 

HLSEVFAASV P RNI APKFSIPESE 

FVCYAYARAV PRAHIPLRDT SACLDVTRDV 

VFLQASAYGG WQATFVQPF FPPQI 

SLMSSLALST QFSF V AAQLPIPAQN 

.SGRVSAAPS SAGSKSCDTV DLGYQCSPAT 
GSTSGTALGP RGNSHSCDTV DGGYQCFPEI 

. .SRVSAQAP WQNHSCNTA DGGYQCFPNV 
GVTSGLAVPA SRNQSSCDTV DQGYQCFSET 
RSTSGTPLGP RGKHSDCNSV DHGYQCFPEL 
LYVS...RNP HVDSHSCNTV EGGYQCRPEI 

TASPAIPPFW RKKHPNVD I 

AIASL QSESRPCDTP DLGFQCGTAI 

AAASL RVLSPSCDSP ELGYQCDQQT 

QPV 



P_invo 1 t u s_Al 
P_invo 1 tus_A2 
T_jpubescens 
A pediades 
P__lycii 
A_fumigatus 
consphyA 
Ajnidulans 
A_f i cuum_NRRL3 13 5 
A_terreus 
T_thermo 
T_lanuginosa 
M_t he rmophi 1 a 
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QRNWSPYSPY 
QRNWSPYSPY 
QQSWSMYSPY 
QDSWAAYTPY 
TSNWGPYDPF 
SHLWGQYSPF 
SHLWGQYSPY 
SHVWGQYSPY 
SHLWGQYAPF 
SHKWGLYAPY 
SHSWGQYSPF 
ARHWGQYSPF 
SHFWGQYSPY 
THTWGQYSPF 



FPIiAEYKA. . 
FPLAEYKA. . 
FPAATYVA. . 
YPVQAYTP. . 
FPVEPYAA. . 
FSLEDELSVS 
FSLEDESAIS 
FSIEQESAIS 
FSLANESVIS 
FSLQDESPFP 
FSLADQSEIS 
FSLAEVSEIS 
FSVP. . SELD 
FSVP SEIS 



. - PPAGCQIN 
. .PPAGCEIN 
. .PPASCQIN 
. . PPKDCKIT 
. . PPEGCTVT 
SKLPKDCRIT 
PDVPDDCRVT 
EDVPHGCEVT 
PEVPAGCRVT 
LDVPEDCHIT 
PDVPQNCKIT 
PAVPKGCRVE 
ASIPDDCEVT 
PSVPDGCRLT 



QVNIIQRHGA 
QVNIIQRHGA 
QVHIIQRHGA 
QVNIIQRHGA 
QVNLIQRHGA 
LVQVLSRHGA 
FVQVLSRHGA 
FVQVLSRHGA 
FAQVLSRHGA 
FVQVLARHGA 
FVQLLSRHGA 
FVQVLSRHGA 
FAQVLSRHGA 
FAQVLSRHGA 



83 
100 

RFPTSGATTR 
RFPTSGAATR 
RFPTSGAAKR 
RFPTS GAGTR 
RWPTSGARSR 
RYPTSSKSKK 
RYPTSSKSKA 
RYPTESKSKA 
RYPTDSKGKK 
RSPTHSKTKA 
RYPTSSKTEL 
RYPTAHKSEV 
RAPTLKRAAS 
RFPTPGKAAA 



P_invol t US_A1 
P_involtus_A2 
Tjpubescens 
A_j?ediades 
P_lycii 
A_fumigatus 
consphyA 
A_nidulans 
A_f i cuum_NRRL3 135 
A_terreus 
T_thermo 
T_l anug ino s a 
M_thermophila 
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IKAGLTKLQG 
IKAGLSKLQS 
IQTAVAKLKA 
IQAAVKKLQS 
QVAAVAKIQM 
YKKLVTAIQA 
YSALIEAIQK 
YSGLIEAIQK 
YSALIEEIQQ 
YAATIAAIQK 
YSQLISRIQK 
YAELLQRIQD 
YVDLIDRIHH 
ISAVLTKIKT 



VQNFTDAKFN 
VQNFTDPKFD 
ASNYTDPLLA 
AKTYTDPRLD 
ARPFTDPKYE 
NATDFKGKFA 
NATAFKGKYA 
NATSFWGQYA 
NATTFDGKYA 
SATAFPGKYA 
TATAYKGYYA 
TATE FKGDFA 
GAISYGPGYE 
SATWYGSDFQ 



FIKSFKYDLG 
FIKSFTYDLG 
FVTNYTYSLG 
FLTNYTYTLG 
FLNDFVYKFG 
FLKTYNYTLG 
FLKTYNYTLG 
FLESYNYTLG 
FLKTYNYSLG 
FLQSYNYSLD 
FLKDYRYQLG 
FLRDYAYHLG 
FLRTYDYTLG 
FIKNYDYVLG 



NSDLVPFGAA 
TSDLVPFGAA 
QDSLVELGAT 
HDDLVPFGAL 
VADLLPFGAN 
ADDLTPFGEQ 
ADDLTPFGEN 
ADDLTIFGEN 
ADDLTPFGEQ 
SEELTPFGRN 
ANDLTPFGEN 
ADNLTRFGEE 
ADELTRTGQQ 
VDHLTAFGEQ 



133 
150 

QSFDAGQEAF 
QSFDAGLEVF 
QSSEAGQEAF 
QSSQAGEETF 
QSHQTGTDMY 
QLVNSGIKFY 
QMVNSGIKFY 
QMVDSGAKFY 
ELVNSGIKFY 
QLRDLGAQFY 
QMIQLGIKFY 
QMMESGRQFY 
QMVNSGIKFY 
EMVNSGIKFY 



P_invo 1 tus_Al 
P_involtus_A2 



134 176 

151 200 

ARYSKLVSKN NLPFIRADGS DRWDSATNW TAGFASA. . . ... . SHNTVQ 

ARYSKLVSSD NLPFIRSDGS DRWDTATNW TAGFASA SRNAIQ 



WO 99/49022 
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Tjpubescens 
A_pediades 
P_lycii 
A_fumigatus 
consphyA 
A_nidulans 
A_f i cuum_JJRRL3 135 
A_terreus 
TJthermo 
T_lanuginosa 
M_thermophila 
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TRYSSLVSAD ELPFVRASGS DRWATANNW TAGFALA. . . 
QRYSFLVSKE NLPFVRASSS NRWDSATNW TEGFSAA. . . 
TRYSTLFEGG DVPFVRAAGD QRWDSSTNW TAGFGDA. . , 
QRYKAL . ARS WPFIRASGS DRVIASGEKF IEGFQQAKLA 
RRYKAL.ARK IVPFIRASGS DRVIASAEKF IEGFQSAKLA 
RRYKNL.ARK NTPFIRASGS DRWASAEKF INGFRKAQLH 
QRYESL.TRN IVPFIRSSGS SRVIASGKKF IEGFQSTKLK 
ERYNAL - TRH INPFVRATDA SRVHESAEKF VEGFQTARQD 
NHYKSL.ARN AVPFVRCSGS DRVTASGRLF IEGFQSAKVL 
HRYREQ. ARE IVPFVRAAGS ARVIASAEFF NRGFQDAKDR 
RRYRAL. ARK SIPFVRTAGQ DRWHSAENF TQGFHSALLA 
QRYSSLIDSD TLPFVRASGQ ERVTASAENF TTGFYSALSA 
QTE 



SSNSIT 

... * SHHVLN 
. . . • SGETVL 
DPGA . TNRAA 
DPGSQPHQAS 
DHGS . . KRAT 
DPRAQPGQSS 
DHHANPHQPS 
DPHSDKHDAP 
DPRSNKDQAE 
DRGSTVRPTL 
DKNPPSSLPR 



P_invol tus_Al 
P_involtus_A2 
T_pubescens 
A_pediades 
PJLycii 
A_fumigatus 
consphyA 
A_jnidulans 
A_f i cuumJNRRL 3 13 5 
A__terreus 
T_thermo 
T_l anugino s a 
M_thermophi la 



177 
201 

PKLNLILPQT 
PKLDLILPQT 
PVLSVIISEA 
PIIiFVILSES 
PTLQWLQEE 
PAISVIIPES 
PVIDVIIPEG 
PWNVIIPEI 
PKIDWISEA 
PRVDVAIPEG 
PTINVIIEEG 
PVINVTISEE 
PYDMWIPET 
P.EMVIISEE 



G. .NDTLEDN 
G. . NDTLEDN 
G. . NDTLDDN 
L. . NDTLDDA 
G. .NCTLCNN 
ETFNNTLDHG 
SGYNNTLDHG 
DGFNNTLDHS 
SSSNNTLDPG 
SAYNNTLEHS 
PSYNNTLDTG 
TGSNNTLDGL 
AGANNTIiHND 
PTANNTMHHG 



MCPAAGD . . • 
MCPAAGE . . . 
MCPAAGD. . . 
MCPNAGS . . . 
MCPNEVD. . . 
VCTKFEA. . . 
TCTAFED . . . 
TCVSFEN . . . 

TCTVFED 

LCTAFES . . . 
SCPVFED . . . 
TCPAAEE . . . 
LCTAFEEGPY 
XjCRSFED 



. SDPQVNA 
• SDPQVDA 
. SDPQVNQ 
♦SDPQTGI 
.GD.ESTT 
SQLGDEVAAN 
SELGDDVEAN 
DERADEIEAN 
SELADTVEAN 
STVGDDAVAN 
SSGGHDAQEK 
AP.DPTQPAE 
STIGDDAQDT 
STTGDQAQAE 



217 
250 

WIiAVAFPSIT 
WLASAFPSVT 
WIiAQFAPPMT 
WTSIYGTPIA 
WLGVFAPNIT 
FTALFAPDIR 
FTALFAPAIR 
FTAIMGPPIR 
FTATFVPSIR 
FTAVFAPAIA 
FAKQFAPAIL 
FLQVFGPRVL 
YIiS TFAGP IT 
FIAATFPPIT 



P_involtUS__Al 
P_involtus_A2 
T_j?ubescens 
A_pediades 
PJLycii 
A_fumigatus 
consphyA 
Ajnidulans 
A_f i cuum_NRRL 3 13 5 
A_terreus 
T_thermo 
T_lanuginosa 
M_thermophila 



218 
251 

ARIiNAAAPSV 
AQLNAAAPGA 
ARXjNAGAPGA 
NRLNQQAPGA 
ARLNAAAPSA 
ARAEKHLPGV 
ARLEADLPGV 
KRLENDLPGI 
QRLENDLSGV 
QRLEADLPGV 
EKIKDHLPGV 
KKITKHMPGV 
ARVNANLPGA 
ARLNAGFKGV 
Q 



252 
300 

NLTDTDAFNL VSLCAFLTVS KEKK s 

NLTDADAFNL VSLCPFMTVS KEQK S 

NLTDTDTYNL LTLCPFETVA TERR S 

NITAADVSNL IPLCAFETIV KETP S 

NLSDSDALTL MDMCPFDTLS SGNA S 

TLTDEDWSL MDMCSFDTVA RTSD. .ASQ LS 

TLTDEDWYL MDMCPFETVA RTSD . . ATE LS 

KLTNENVI YL MDMCSFDTMA RTAH . . GTE LS 

TLTDTEVTYL MDMCSFDTIS TSTV. .DTK LS 

QLSTDDWNL MAMCPFETVS LTDD..AHT LS 

DLAVSDVPYL MDLCPFETLA RNHT. .DT LS 

NLTLEDVPLF MDLCPFDTVG SDPVLFPRQ LS 

NLTDADTVAL MDLCPFETVA SSSSDPATAD AGGGNGRPLS 
TLSNTDVLSL MDLCPFDTVA YPLSSLTTTS SVSGGGK LS 



P_involtus_Al DFCTLFEGIP GSFEAFAYGG DLDKFYGTGY GQELGPVQGV GYVNELIARL 

P_involtus_A2 DFCTLFEGIP GS FEAFAYAG DLDKFYGTGY GQALGPVQGV GYINELLARL 

Tjpubescens EFCDIYEELQ AE.DAFAYNA DLDKFYGTGY GQPLGPVQGV GYINELIARL 

A_pediades PFCNLFT . . P EEFAQFEYFG DLDKFYGTGY GQPLGPVQGV GYINELLARL 

P_lycii PFCDLFT..A EEYVSYEYYY DLDKYYGTGP GNALGPVQGV GYVNELLARL 

A_fumigatus PFCQLFT. .H NEWKKYNYLQ SLGKYYGYGA GNPLGPAQGI GFTNELIARL 

FiV, OP 
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consphyA PFCALFT . 

A_nidul ans PFCAIFT . 

A_f i cuum_NRRL3 135 PFCDLFT . 

A_t err eus PFCDLFT • 

T_thermo PFCALST . 

^lanuginosa PFCHLFT . 

M_thermophila PFCRLFS . 

PFCSLFT 
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.H DEWRQYDYLQ SLGKYYGYGA GNPLGPAQGV GFANELIARL 
.E KEWLQYDYLQ SLSKYYGYGA GSPLGPAQGI GFTNELIARL 
.H DEWINYDYLQ SLKKYYGHGA GNPLGPTQGV GYANELIARL 
.A TEWTQYNYLL SLDKYYGYGG GNPLGPVQGV GWANELMARL 
.Q EEWQAYDYYQ SLGKYYGNGG GNPLGPAQGV GFVNELIARM 
• A DDWMAYDYYY TLDKYYSHGG GSAFGPSRGV GFVNELIARM 
.E SEWRAYDYLQ SVGKWYGYGP GNPLGPTQGV GFVNELLARL 
A SDWTIYDYLQ SLGKYYGFGP GNSLAATQGV GYVNELIARL 



P_involtus_Al 
P_invol tus_A2 
T_pubeacens 
A_pediades 
P_lycii 
A_fumigatus 
consphyA 
A_nidulans 
A_f i cuum — NRRL3 13 5 
A_terreus 
T_thermo 
TJLanuginosa 
M_t hermophi 1 a 



301 
351 

TNS.AVRDNT 
TNS . AVNDNT 
TAQ . NVSDHT 
TEM.PVRDNT 
TGQ.AVRDET 
TRS . PVQDHT 
TRS . PVQDHT 
TQS . PVQDNT 
THS . PVHDDT 
TRA. PVHDHT 
THS . PVQDYT 
TGNLPVKDHT 
A.GVPVRDGT 
I RAPWDHT 



QTNRTLDASP 
QTNRTLDAAP 
QTNSTLDSSP 
QTNRTLDSSP 
QTNRTLDSDP 
STNSTLVSNP 
STNHTLDSNP 
STNHTLDSNP 
SSNHTLDSSP 
CVNNTLDASP 
TVNHTLDSNP 
TVNHTLDDNP 
STNRTLDGDP 
TTNSTLDGDE 



VTFFLNKTFY 
DTFPLNKTMY 
ETFPLNRTLY 
LTFPLDRSIY 
ATFPLNRTFY 
ATFPLNATMY 
ATFPLNATLY 
ATFPLDRKLY 
ATFPLKSTLY 
ATFPLNATLY 
ATFPLNATLY 
ETFPLDAVLY 
RTFPLGRPLY 
KTFPLNRTVY 



ADFSHDNLMV 
ADFSHDNLMV 
ADFSHDNQMV 
ADLSHDNQMI 
ADFSHDNTMV 
VDFSHDNSMV 
ADFSHDNSMI 
ADFSHDNSMI 
ADFSHDNGII 
ADFSHDSNLV 
ADFSHDNTMT 
AD FSHDNTMT 
ADFSHDNDMM 
ADFSHDNDMM 



349 
400 

AVFSAMGLFR 
AVFSAMGLFR 
AIFSAMGLFN 
AIFSAMGLFN 
PIFAALGLFN 
SIFFALGLYN 
SIFFALGLYN 
SIFFAMGLYN 
SILFALGLYN 
SIFWALGLYN 
SIFAALGLYN 
GIFSAMGLYN 
GVLGALGAYD 
NILTALRIFE 





350 








383 






401 










450 


P_involtus_Al 


QPAPLSTSVP 






WRTSSLVPFS 


GRMWERLSC 




P_invo 1 tus_A2 


QSAPLSTSTP 


DPNR. . 


. . .T 


WLTSSWPFS 


ARMAVERLSC 




T__pubescens 


QSAPLDPTTP 


DPAR. . 


. . .T 


FLVKKIVPFS 


ARMWERLDC 




Ajpediades 


QSSPLDPSFP 






WVTSRLTPFS 


ARMVTERLLC 


QRDGTGSGGP 


P_lycii 


ATA . LDPLKP 




. . .L 


WVDSKLVPFS 


GHMTVEKLAC 




A_fumigatus 


GTEPLSRTSV 


ESAKE - 


. LDG 


YSASWWPFG 


ARAYFETMQC 




consphyA 


GTAPLSTTSV 


ESIEE. 


.TDG 


YSASWTVPFG 


ARAYVEMMQC 




Ajnidulans 


GTQPLSMDSV 


ESIQE. 


.MDG 


YAASWTVPFG 


ARAYFELMQC 




A_f i cuum_NRRL3 135 


GTKPLSTTTV 


ENITQ. 


.TDG 


FSSAWTVPFA 


SRLYVEMMQC 




A__terreus 


GTAPLSQTSV 


ESVSQ. 


• TDG 


YAAAWTVPFA 


ARAYVEMMQC 




T_thertno 


GTAKLSTTEI 


KSIEE. 


.TDG 


YSAAWTVPFG 


GRAYIEMMQC 




T_lanuginosa 


GTKPLSTSKI 


QPPTGAAADG 


YAASWTVPFA 


ARAYVELLRC 


ETETS SEEEE 


M_thermophila 


GVPPLDKTAR 


RDPEE - 


• LGG 


YAASWAVPFA 


ARIYVEKMRC 


SGGGGGGGGG 




HISPMDQTGD 


DGVKE 


RDL 


FKVSWAVPFA 


GRVYFEKMVC 


DADGDGKIDS 



NTTIPTNYG 



384 

451 

P_involtus_Al FGT 

P_involtus_A2 AGT 

Tjpubescens GGA 

Ajpediades SRIMRNGNVQ 

P_lycii . SGK 

A_fumigatus K. .S . . .EKE 

consphyA Q. .A. . . EKE 

A_nidulans E KKE 

A_f icuum_NRRL3135 Q. -A. . . EQA 

A__terreus R. .A. . .EKE 

T thertno D..D...SDE 



TKVRVLVQDQ 
TKVRVLVQDQ 
QSVRLLVNDA 
TFVRILVNDA 
EAVRVLVNDA 
PLVRALINDR 
PLVRVLVNDR 
PLVRVLVNDR 
PLVRVLVNDR 
PLVRVLVNDR 
PWRVLVNDR 



VQPLEFCGGD 
VQPLEFCGGD 
VQPLAFCGAD 
LQPLKFCGGD 
VQPLEFCGG. 
WPLHGCDVD 
WPLHGCAVD 
WPLHGCAVD 
WPLHGCPVD 
VMPLHGCPTD 
WPLHGCEVD 



RNGLCTLAKF 
QDGLCALDKF 
TSGVCTLDAF 
MDSLCTLEAF 
VDGVCELSAF 
KLGRCKLNDF 
KLGRCKRDDF 
KFGRCTLDDW 
ALGRCTRDSF 
KLGRCKRDAF 
SLGRCKRDDF 



425 
500 

VESQTFARSD 
VESQAYARSG 
VESQAYARND 
VESQKYARED 
VESQTYAREN 
VKGLSWARSG 
VEGLSFARSG 
VEGLNFARSG 
VRGLSFARSG 
VAGLS FAQAG 
VRGLSFARQG 



47A 
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T_lanuginosa 
M__ thermophi 1 a 



P_involtus_Al 
P__invol tus_A2 
T__pubescens 
A_pediades 
PJLycii 
A_fumigatus 
consphyA 
A_nidulans 
A_f i cuurn_NRRL3 135 
A_terreus 
T_thermo 
T_ lanuginosa 
M_thermophila 



E. .G...EDE PFVRVLVNDR WPLHGCRVD RWGRCRRDEW IXBLTFARQG 
E. .GRQEKDE EMVRVLVNDR VMTLKGCGAD ERGMCTLERP IESMAFARGN 
D EAQK EliVRILVNDR VMRLNGCDAD EQGRCGLEKF VE5MEFARRG 

426 439 
501 514 
GAGDFEKCFA TSA 
GAGDFEKCLA TTV 
GEGDFEKCFA T. . 
GQGDFEKCFD . . . 
GQGDFAKCGF VPSE 
. . GNWGECFS . . . 
. . GNWAECFA *♦ . 
. . GNWKTCFT L. . 
. . GDWAECFA . . . 
. .GNWADCF. . . . 
. . GNWEGCYA ASE 
. . GHWDRCF . . . . 
. . GKWDLCFA . . , 
GEWEECFV 
R 



WO 99/49022 



PCT/DK99/00153 



SEQUENCE LISTING 

<212> DNA 

<213> Cladorrhinum f oecundissimum 

5 <220> 

<221> intron 
<222> (71) . . (126) 

<220> 
10 <221> CDS 

<222> (20) . . (70) 

<220> 
<221> CDS 
15 <222> (127) . - (1563) 

<220> 

<221> sig_jpeptide 
<222> (20) . . (64) 

20 

<400> 1 

aagcttgggc aaactcatc atg etc ate ttg atg att cca ctg ttc age tac 52 

Met Leu lie Leu Met lie Pro Leu Phe Ser Tyr 
15 10 

25 

ctg get get get tct ctg tgggttcatc ctttgcccct gtctcgatgt 10 0 

Leu Ala Ala Ala Ser Leu 
15 

30 taaaatacta aacatatttc accaga cgt gta etc tec cct cag cca gtg tec 153 

Arg Val Leu Ser Pro Gin Pro Val Ser 
20 25 

tgt gac age ccg gag ctt ggt tac caa tgc gac cag cag aca acg cac 2 01 
35 Cys Asp Ser Pro Glu Leu Gly Tyr Gin Cys Asp Gin Gin Thr Thr His 
30 35 40 

acc tgg ggt caa tac tea ccc ttc ttc tct gtc ccg tea gag ate tec 24 9 
Thr Trp Gly Gin Tyr Ser Pro Phe Phe Ser Val Pro Ser Glu lie Ser 
40 45 50 55 

cct tec gtt cct gat ggc tgc cgc etc acc tte gee caa gtt etc tec 2 97 
Pro Ser Val Pro Asp Gly Cys Arg Leu Thr Phe Ala Gin Val Leu Ser 
60 65 70 

45 

cgc cac ggc gee cgc ttc cca acc ccg ggt aaa gec gee gec ate tec 345 
Arg His Gly Ala Arg Phe Pro Thr Pro Gly Lys Ala Ala Ala lie Ser 
75 80 85 90 

50 get gtc etc acc aaa ate aaa acc tct gec acc tgg tac ggt tec gac 3 93 
Ala Val Leu Thr Lys He Lys Thr Ser Ala Thr Trp Tyr Gly Ser Asp 
95 100 105 



ttt cag ttc ate aag aac tac gac tat gta ctt ggc gta gac cac ctg 
55 Phe Gin Phe He Lys Asn Tyr Asp Tyr Val Leu Gly Val Asp His Leu 
110 115 120 



441 
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acc gcg ttc ggc gag caa gaa atg gtc aac tec ggc ate aag ttc tac 489 
Thr Ala Phe Gly Glu Gin Glu Met Val Asn Ser Gly He Lys Phe Tyr 
125 130 135 

cag cgc tac tec tec etc ate cag aea gaa gae teg gat acg etc ccc 537 
Gin Arg Tyr Ser Ser Leu He Gin Thr Glu Asp Ser Asp Thr Leu Pro 
140 145 150 

10 ttc gtc cgc gee tct ggc cag gaa cgc gtc ate gec tec gec gag aac 585 
Phe Val Arg Ala Ser Gly Gin Glu Arg Val He Ala Ser Ala Glu Asn 
155 160 165 170 

ttc acc acc ggc ttc tac teg gec etc tea gec gac aag aac cct cct 633 
15 Phe Thr Thr Gly phe Tyr Ser Ala Leu Ser Ala Asp Lys Asli Pro Pro 

175 180 las 

tec tec tta cca aga cca gaa atg gtc ate att tct gag gag cca aca 681 
Ser Ser Leu Pro Arg Pro Glu Met Val He He Ser Glu Glu Pro Thr 
20 «P 195 200 

gec aac aac acc atg cac cac ggc etc tgc ego tec ttt gaa gat tec 729 
Ala Asn Asn Thr Met His His Gly Leu Cys Arg Ser Phe Glu Asp Ser 
205 210 215 

25 

acc acc ggc gac caa gec caa gcg gaa ttc ate gec gee acc ttc cca 777 
Thr Thr Gly Asp Gin Ala Gin Ala Glu Phe He Ala Ala Thr Phe Pro 
220 225 230 

30 ccc ate acc gec cgt etc aac gee caa ggt ttc aaa ggc gtc acc etc 825 
Pro He Thr Ala Arg Leu Asn Ala Gin Gly Phe Lys Gly Val Thr Leu 
235 240 245 250 

tec aac acc gac gtc eta tea eta atg gac etc tgc ccc ttt gac acc 873 
35 Ser Asn Thr Asp Val Leu Ser Leu Met Asp Leu Cys Pro Phe Asp Thr 

255 260 265 

gtc gec tac ccc ctt tec tec etc acc acc acc tct tec gtt tct gga 921 
Val Ala Tyr Pro Leu Ser Ser Leu Thr Thr Thr Ser Ser Val Ser Glv 
40 270 275 280 

ggc ggc aag tta tec ccc ttc tgc tct ctt ttc act gec age gac tgg 969 

Gly Gly Lys Leu Ser Pro Phe Cys Ser Leu Phe Thr Ala Ser Asp Trp 
285 290 295 

45 

aca ate tac gat tac etc cag tec eta ggg aaa tac tac ggt ttc ggc 1017 

Thr He Tyr Asp Tyr Leu Gin Ser Leu Gly Lys Tyr Tyr Gly Phe Gly 

3 °0 305 310 

50 ccc ggt aat tec eta get gee acc cag ggg gta ggg tac gtc aac gag 1065 
Pro Gly Asn Ser Leu Ala Ala Thr Gin Gly Val Gly Tyr Val Asn Glu 
315 320 325 330 

ctt ate gec cgc ttg ate cgt get ccc gte gta gat cac acg acg acc 1113 
55 Leu He Ala Arg Leu He Arg Ala Pro Val Val Asp His Thr Thr Thr 

335 340 345 
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aac tct act ctt gat ggc gac gaa aaa acg ttt ccg ttg aac aga acg 1161 

Asn Ser Thr Leu Asp Gly Asp Glu Lys Thr Phe Pro Leu Asn Arg Thr 
350 355 360 

5 

gtg tat gcg gat ttt tec cat gat aat gat atg atg aat ate ctg act 1209 

Val Tyr Ala Asp Phe Ser His Asp Asn Asp Met Met Asn lie Leu Thr 
365 370 375 

10 get ttg egg ata ttc gag cat ate agt ccg atg gat aac ace act ate 1257 
Ala Leu Arg lie Phe Glu His lie Ser Pro Met Asp Asn Thr Thr lie 
380 385 390 

ccg ace aac tat ggc cag aca gga gat gac ggg gtg aag gaa agg gat 13 05 
15 Pro Thr Asn Tyr Gly Gin Thr Gly Asp Asp Gly Val Lys Glu Arg Asp 
395 400 405 410 

ttg ttc aag gtt agt tgg gcg gtg ccc ttt get ggg agg gtg tac ttt 1353 
Leu Phe Lys Val Ser Trp Ala Val Pro Phe Ala Gly Arg Val Tyr Phe 
20 415 420 425 

gag aaa atg gtt tgt gat gcg gat ggg gat ggc aag att gat agt gat 1401 

Glu Lys Met Val Cys Asp Ala Asp Gly Asp Gly Lys lie Asp Ser Asp 

430 435 440 

25 

gag get cag aaa gag ttg gtg agg att ttg gtt aat gat egg gtg atg 1449 

Glu Ala Gin Lys Glu Leu Val Arg lie Leu Val Asn Asp Arg Val Met 
445 450 455 

30 aga ttg aat ggg tgt gat get gat gaa cag ggt agg tgt gga ttg gag 1497 
Arg Leu Asn Gly Cys Asp Ala Asp Glu Gin Gly Arg Cys Gly Leu Glu 
460 465 470 

aag ttt gtg gag agt atg gag ttt gcg agg aga ggg ggg gag tgg gag 1545 
35 Lys Phe Val Glu Ser Met Glu Phe Ala Arg Arg Gly Gly Glu Trp Glu 
475 480 485 490 

gag agg tgt ttt gtt tag ctctaga 1570 
Glu Arg Cys Phe Val 
40 495 



<210> 2 

<211> 495 

45 <212> PRT 

<213> Cladorrhinum f oecundissimum 

<400> 2 

Met Leu lie Leu Met lie Pro Leu Phe Ser Tyr Leu Ala Ala Ala Ser 
50 1 5 10 15 

Leu Arg Val Leu Ser Pro Gin Pro Val Ser Cys Asp Ser Pro Glu Leu 
20 25 30 



55 Gly Tyr Gin Cys Asp Gin Gin Thr Thr His Thr Trp Gly Gin Tyr Ser 
35 40 45 
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Pro Phe Phe Ser Val Pro Ser Glu He Ser Pro Ser Val Pro Asp Gly 
50 55 60 

5 Cys Arg Leu Thr Phe Ala Gin Val Leu Ser Arg His Gly Ala Arg Phe 
65 70 75 80 

Pro Thr Pro Gly Lys Ala Ala Ala He Ser Ala Val Leu Thr Lys He 
85 90 95 

10 

Lys Thr Ser Ala Thr Trp Tyr Gly Ser Asp Phe Gin Phe He Lys Asn 
100 105 110 

Tyr Asp Tyr Val Leu Gly Val Asp His Leu Thr Ala Phe Gly Glu Gin 
15 lis 120 125 

Glu Met Val Asn Ser Gly He Lys Phe Tyr Gin Arg Tyr Ser Ser Leu 
130 135 140 

20 He Gin Thr Glu Asp Ser Asp Thr Leu Pro Phe Val Arg Ala Ser Gly 
145 150 155 160 

Gin Glu Arg Val He Ala Ser Ala Glu Asn Phe Thr Thr Gly Phe Tyr 
165 170 175 

25 

Ser Ala Leu Ser Ala Asp Lys Asn Pro Pro Ser Ser Leu Pro Arg Pro 
180 185 190 

Glu Met Val He He Ser Glu Glu Pro Thr Ala Asn Asn Thr Met His 
30 195 200 205 

His Gly Leu Cys Arg Ser Phe Glu Asp Ser Thr Thr Gly Asp Gin Ala 
210 215 220 

35 Gin Ala Glu Phe He Ala Ala Thr Phe Pro Pro He Thr Ala Arg Leu 
225 230 235 240 

Asn Ala Gin Gly Phe Lys Gly Val Thr Leu Ser Asn Thr Asp Val Leu 
245 250 255 

40 

Ser Leu Met Asp Leu Cys Pro Phe Asp Thr Val Ala Tyr Pro Leu Ser 
260 265 270 

Ser Leu Thr Thr Thr Ser Ser Val Ser Gly Gly Gly Lys Leu Ser Pro 
45 275 280 285 

Phe Cys Ser Leu Phe Thr Ala Ser Asp Trp Thr He Tyr Asp Tyr Leu 
290 295 300 

50 Gin Ser Leu Gly Lys Tyr Tyr Gly Phe Gly Pro Gly Asn Ser Leu Ala 
305 310 315 320 

Ala Thr Gin Gly Val Gly Tyr Val Asn Glu Leu lie Ala Arg Leu He 
325 330 335 

55 

Arg Ala Pro Val Val Asp His Thr Thr Thr Asn Ser Thr Leu Asp Gly 
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340 

Asp Glu Lys Thr Phe 
355 

5 

His Asp Asn Asp Met 
370 

His lie Ser Pro Met 
10 385 

Thr Gly Asp Asp Gly 
405 

15 Ala Val Pro Phe Ala 
420 

Ala Asp Gly Asp Gly 
435 

20 

Val Arg lie Leu Val 
450 

Ala Asp Glu Gin Gly 
25 465 

Glu Phe Ala Arg Arg 
485 



5 

345 

Pro Leu Asn Arg Thr 
360 

Met Asn lie Leu Thr 
375 

Asp Asn Thr Thr lie 
390 

Val Lys Glu Arg Asp 
410 

Gly Arg Val Tyr Phe 
425 

Lys lie Asp Ser Asp 
440 

Asn Asp Arg Val Met 
455 

Arg Cys Gly Leu Glu 
470 

Gly Gly Glu Trp Glu 
490 



350 

Val Tyr Ala Asp Phe Ser 
365 

Ala Leu Arg lie Phe Glu 
380 

Pro Thr Asn Tyr Gly Gin 
395 400 

Leu Phe Lys Val Ser Trp 
415 

Glu Lys Met Val Cys Asp 
430 

Glu Ala Gin Lys Glu Leu 
445 

Arg Leu Asn Gly Cys Asp 
460 

Lys Phe Val Glu Ser Met 
475 480 

Glu Arg Cys Phe Val 
495 
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